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1 INTRODUCTION 



1 Introduction 

1.1 Major sources 

The textbooks which I have consulted most frequently while developing course 
material are: 

The principles of quantum mechanics, RA.M. Dirac, 4th Edition (revised), (Ox- 
ford University Press, Oxford, UK, 1958). 

The Feynman lectures on physics, R.R Feynman, R.B. Leighton, and M. Sands, 
Volume III (Addison-Wesley Reading MA, 1965). 

Quantum mechanics, E. Merzbacher, 2nd Edition (John Wiley & Sons, New York 
NY, 1970). 

Modern quantum mechanics, J.J. Sakurai, (Benjamin/Cummings, Menlo Park 
CA, 1985). 
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2 Fundamental concepts 

2.1 The breakdown of classical physics 

The necessity for a departure from classical mechanics is clearly demonstrated 
by: 

1. The anomalous stability of atoms and molecules: According to classical physics, 
an electron orbiting a nucleus should lose energy by emission of synchrotron 
radiation, and gradually spiral in towards the nucleus. Experimentally, this 
is not observed to happen. 

2. The anomalously low specific heats of atoms and molecules: According to the 
equipartition theorem of classical physics, each degree of freedom of an 
atomic or molecular system should contribute R/2 to its molar specific heat, 
where R is the ideal gas constant. In fact, only the translational and some 
rotational degrees of freedom seem to contribute. The vibrational degrees 
of freedom appear to make no contribution at all (except at high temper- 
atures). Incidentally, this fundamental problem with classical physics was 
known and appreciated in the middle of the nineteenth century. Stories that 
physicists at the start of the twentieth century thought that classical physics 
explained everything, and that there was nothing left to discover, are largely 
apocryphal (see Feynman, Vol. I, Cha. 40). 

3. The ultraviolet catastrophe: According to classical physics, the energy density 
of an electromagnetic field in vacuum is infinite due to a divergence of en- 
ergy carried by short wave-length modes. Experimentally, there is no such 
divergence, and the total energy density is finite. 

4. Wave-particle duality: Classical physics can deal with waves or particles. How- 
ever, various experiments (e.g., light interference, the photo-electric effect, 
electron diffraction) show quite clearly that waves sometimes act as if they 
were streams of particles, and streams of particles sometimes act as if they 
were waves. This is completely inexplicable within the framework of classi- 
cal physics. 
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2.2 The polarization of photons 

It is known experimentally that when plane polarized light is used to eject photo- 
electrons there is a preferred direction of emission of the electrons. Clearly, the 
polarization properties of light, which are more usually associated with its wave- 
like behaviour, also extend to its particle-like behaviour. In particular, a polariza- 
tion can be ascribed to each individual photon in a beam of light. 

Consider the following well-known experiment. A beam of plane polarized 
light is passed through a polaroid film, which has the property that it is only 
transparent to light whose plane of polarization lies perpendicular to its optic 
axis. Classical electromagnetic wave theory tells us that if the beam is polarized 
perpendicular to the optic axis then all of the light is transmitted, if the beam is 
polarized parallel to the optic axis then none of the light is transmitted, and if the 
light is polarized at an angle oc to the axis then a fraction sin oc of the beam is 
transmitted. Let us try to account for these observations at the individual photon 
level. 

A beam of light which is plane polarized in a certain direction is made up of a 
stream of photons which are each plane polarized in that direction. This picture 
leads to no difficulty if the plane of polarization lies parallel or perpendicular 
to the optic axis of the polaroid. In the former case, none of the photons are 
transmitted, and, in the latter case, all of the photons are transmitted. But, what 
happens in the case of an obliquely polarized incident beam? 

The above question is not very precise. Let us reformulate it as a question 
relating to the result of some experiment which we could perform. Suppose that 
we were to fire a single photon at a polaroid film, and then look to see whether 
or not it emerges from the other side. The possible results of the experiment are 
that either a whole photon, whose energy is equal to the energy of the incident 
photon, is observed, or no photon is observed. Any photon which is transmitted 
though the film must be polarized perpendicular to the optic axis. Furthermore, 
it is impossible to imagine (in physics) finding part of a photon on the other side 
of the film. If we repeat the experiment a great number of times then, on average, 
a fraction sin oc of the photons are transmitted through the film, and a fraction 
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cos 2 oc are absorbed. Thus, we conclude that a photon has a probability sin 2 oc of 
being transmitted as a photon polarized in the plane perpendicular to the optic 
axis, and a probability cos 2 oc of being absorbed. These values for the probabilities 
lead to the correct classical limit for a beam containing a large number of photons. 

Note that we have only been able to preserve the individuality of photons, 
in all cases, by abandoning the determinacy of classical theory, and adopting a 
fundamentally probabilistic approach. We have no way of knowing whether an 
individual obliquely polarized photon is going to be absorbed by or transmitted 
through a polaroid film. We only know the probability of each event occurring. 
This is a fairly sweeping statement, but recall that the state of a photon is fully 
specified once its energy, direction of propagation, and polarization are known. 
If we imagine performing experiments using monochromatic light, normally in- 
cident on a polaroid film, with a particular oblique polarization, then the state of 
each individual photon in the beam is completely specified, and there is nothing 
left over to uniquely determine whether the photon is transmitted or absorbed by 
the film. 

The above discussion about the results of an experiment with a single obliquely 
polarized photon incident on a polaroid film answers all that can be legitimately 
asked about what happens to the photon when it reaches the film. Questions as 
to what decides whether the photon is transmitted or not, or how it changes its 
direction of polarization, are illegitimate, since they do not relate to the outcome 
of a possible experiment. Nevertheless, some further description is needed in 
order to allow the results of this experiment to be correlated with the results of 
other experiments which can be performed using photons. 

The further description provided by quantum mechanics is as follows. It is 
supposed that a photon polarized obliquely to the optic axis can be regarded as 
being partly in a state of polarization parallel to the axis, and partly in a state of 
polarization perpendicular to the axis. In other words, the oblique polarization 
state is some sort of superposition of two states of parallel and perpendicular 
polarization. Since there is nothing special about the orientation of the optic 
axis in our experiment, we must conclude that any state of polarization can be 
regarded as a superposition of two mutually perpendicular states of polarization. 
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When we make the photon encounter a polaroid film, we are subjecting it 
to an observation. In fact, we are observing whether it is polarized parallel or 
perpendicular to the optic axis. The effect of making this observation is to force 
the photon entirely into a state of parallel or perpendicular polarization. In other 
words, the photon has to jump suddenly from being partly in each of these two 
states to being entirely in one or the other of them. Which of the two states it will 
jump into cannot be predicted, but is governed by probability laws. If the photon 
jumps into a state of parallel polarization then it is absorbed. Otherwise, it is 
transmitted. Note that, in this example, the introduction of indeterminacy into 
the problem is clearly connected with the act of observation. In other words, the 
indeterminacy is related to the inevitable disturbance of the system associated 
with the act of observation. 

2.3 The fundamental principles of quantum mechanics 

There is nothing special about the transmission and absorption of photons through 
a polaroid film. Exactly the same conclusions as those outlined above are ob- 
tained by studying other simple experiments, such as the interference of photons 
(see Dirac, Sect. 1.3), and the Stern-Gerlach experiment (see Sakurai, Cha. 1; 
Feynman, Cha. 5). The study of these simple experiments leads us to formulate 
the following fundamental principles of quantum mechanics: 

1. Dirac 's razor: Quantum mechanics can only answer questions regarding the 
outcome of possible experiments. Any other questions lie beyond the realms 
of physics. 

2. The principle of superposition of states: Any microscopic system (i.e., an atom, 
molecule, or particle) in a given state can be regarded as being partly in 
each of two or more other states. In other words, any state can be regarded 
as a superposition of two or more other states. Such superpositions can be 
performed in an infinite number of different ways. 

3. The principle of indeterminacy: An observation made on a microscopic system 
causes it to jump into one or more particular states (which are related to 
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the type of observation). It is impossible to predict into which final state 
a particular system will jump, however the probability of a given system 
jumping into a given final state can be predicted. 

The first of these principles was formulated by quantum physicists (such as Dirac) 
in the 1920s to fend off awkward questions such as "How can a system suddenly 
jump from one state into another?", or "How does a system decide which state to 
jump into?". As we shall see, the second principle is the basis for the mathemat- 
ical formulation of quantum mechanics. The final principle is still rather vague. 
We need to extend it so that we can predict which possible states a system can 
jump into after a particular type of observation, as well as the probability of the 
system making a particular jump. 

2.4 Ket space 

Consider a microscopic system composed of particles or bodies with specific prop- 
erties (mass, moment of inertia, etc.) interacting according to specific laws of 
force. There will be various possible motions of the particles or bodies consistent 
with the laws of force. Let us term each such motion a state of the system. Accord- 
ing to the principle of superposition of states, any given state can be regarded as 
a superposition of two or more other states. Thus, states must be related to math- 
ematical quantities of a kind which can be added together to give other quantities 
of the same kind. The most obvious examples of such quantities are vectors. 

Let us consider a particular microscopic system in a particular state, which we 
label A: e.g., a photon with a particular energy, momentum, and polarization. 
We can represent this state as a particular vector, which we also label A, residing 
in some vector space, where the other elements of the space represent all of the 
other possible states of the system. Such a space is called a ket space (after Dirac). 
The state vector A is conventionally written 

|A>. (2.1) 

Suppose that state A is, in fact, the superposition of two different states, B and 
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C. This interrelation is represented in ket space by writing 

|A) = |B) + |C), (2.2) 

where |B) is the vector relating to the state B, etc. For instance, state |B) might 
represent a photon propagating in the z-direction, and plane polarized in the x- 
direction, and state |C) might represent a similar photon plane polarized in the 
y -direction. In this case, the sum of these two states represents a photon whose 
plane of polarization makes an angle of 45° with both the x- and y -directions (by 
analogy with classical physics). This latter state is represented by |B) + |C) in ket 
space. 

Suppose that we want to construct a state whose plane of polarization makes 
an arbitrary angle oc with the x-direction. We can do this via a suitably weighted 
superposition of states B and C. By analogy with classical physics, we require 
cos oc of state B, and sin oc of state C. This new state is represented by 

cos <x|B) + sin oc\C) (2.3) 

in ket space. Note that we cannot form a new state by superposing a state with 
itself. For instance, a photon polarized in the y -direction superposed with another 
photon polarized in the y -direction (with the same energy and momentum) gives 
the same photon. This implies that the ket vector 

c 1 |A) + c 2 |A) = (c 1 +c 2 )|A) (2.4) 

corresponds to the same state that |A) does. Thus, ket vectors differ from con- 
ventional vectors in that their magnitudes, or lengths, are physically irrelevant. 
All the states of the system are in one to one correspondence with all the possi- 
ble directions of vectors in the ket space, no distinction being made between the 
directions of the ket vectors |A) and — |A). There is, however, one caveat to the 
above statements. If ci + c 2 = then the superposition process yields nothing at 
all: i.e., no state. The absence of a state is represented by the null vector |0) in 
ket space. The null vector has the fairly obvious property that 

|A) + |0) = |A>, (2.5) 

for any vector |A). The fact that ket vectors pointing in the same direction repre- 
sent the same state relates ultimately to the quantization of matter: i.e., the fact 
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that it comes in irreducible packets called photons, electrons, atoms, etc. If we ob- 
serve a microscopic system then we either see a state (i.e., a photon, or an atom, 
or a molecule, etc.) or we see nothing — we can never see a fraction or a multiple 
of a state. In classical physics, if we observe a wave then the amplitude of the 
wave can take any value between zero and infinity. Thus, if we were to represent 
a classical wave by a vector, then the magnitude, or length, of the vector would 
correspond to the amplitude of the wave, and the direction would correspond to 
the frequency and wave-length, so that two vectors of different lengths pointing 
in the same direction would represent different wave states. 

We have seen, in Eq. (2.3), that any plane polarized state of a photon can 
be represented as a linear superposition of two orthogonal polarization states 
in which the weights are real numbers. Suppose that we want to construct a 
circularly polarized photon state. Well, we know from classical physics that a cir- 
cularly polarized wave is a superposition of two waves of equal amplitude, plane 
polarized in orthogonal directions, which are in phase quadrature. This suggests 
that a circularly polarized photon is the superposition of a photon polarized in 
the x-direction (state B) and a photon polarized in the y -direction (state C), with 
equal weights given to the two states, but with the proviso that state C is 90° 
out of phase with state B. By analogy with classical physics, we can use complex 
numbers to simultaneously represent the weighting and relative phase in a linear 
superposition. Thus, a circularly polarized photon is represented by 

|B) + i|C) (2.6) 

in ket space. A general elliptically polarized photon is represented by 

ci|B) + c 2 |C), (2.7) 

where ci and ci are complex numbers. We conclude that a ket space must be 
a complex vector space if it is to properly represent the mutual interrelations 
between the possible states of a microscopic system. 

Suppose that the ket |R) is expressible linearly in terms of the kets |A) and |B), 
so that 

|R) = Cl |A} + C2|B}. (2.8) 
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We say that |R) is dependent on |A) and |B). It follows that the state R can be 
regarded as a linear superposition of the states A and B. So, we can also say that 
state R is dependent on states A and B . In fact, any ket vector (or state) which 
is expressible linearly in terms of certain others is said to be dependent on them. 
Likewise, a set of ket vectors (or states) are termed independent if none of them 
are expressible linearly in terms of the others. 

The dimensionality of a conventional vector space is defined as the number 
of independent vectors contained in the space. Likewise, the dimensionality of 
a ket space is equivalent to the number of independent ket vectors it contains. 
Thus, the ket space which represents the possible polarization states of a photon 
propagating in the z-direction is two-dimensional (the two independent vectors 
correspond to photons plane polarized in the x- and y -directions, respectively). 
Some microscopic systems have a finite number of independent states (e.g., the 
spin states of an electron in a magnetic field) . If there are N independent states, 
then the possible states of the system are represented as an IN -dimensional ket 
space. Some microscopic systems have a denumerably infinite number of inde- 
pendent states (e.g., a particle in an infinitely deep, one-dimensional potential 
well) . The possible states of such a system are represented as a ket space whose 
dimensions are denumerably infinite. Such a space can be treated in more or less 
the same manner as a finite-dimensional space. Unfortunately, some microscopic 
systems have a nondenumerably infinite number of independent states (e.g., a 
free particle) . The possible states of such a system are represented as a ket space 
whose dimensions are nondenumerably infinite. This type of space requires a 
slightly different treatment to spaces of finite, or denumerably infinite, dimen- 
sions. 

In conclusion, the states of a general microscopic system can be represented as 
a complex vector space of (possibly) infinite dimensions. Such a space is termed 
a Hilbert space by mathematicians. 
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2.5 Bra space 

A snack machine inputs coins plus some code entered on a key pad, and (hope- 
fully) outputs a snack. It also does so in a deterministic manner: i.e., the same 
money plus the same code produces the same snack (or the same error message) 
time after time. Note that the input and output of the machine have completely 
different natures. We can imagine building a rather abstract snack machine which 
inputs ket vectors and outputs complex numbers in a deterministic fashion. Math- 
ematicians call such a machine afunctional. Imagine a general functional, labeled 
F, acting on a general ket vector, labeled A, and spitting out a general complex 
number 4>a- This process is represented mathematically by writing 

(F|(|A» = *a. (2.9) 

Let us narrow our focus to those functional which preserve the linear dependen- 
cies of the ket vectors upon which they operate. Not surprisingly, such functional 
are termed linear functionals. A general linear functional, labeled F, satisfies 

(F|(|A} + |B)) = (F|(|A)) + (F|(|B)), (2.10) 

where |A) and |B) are any two kets in a given ket space. 

Consider an N -dimensional ket space [i.e., a finite-dimensional, or denumer- 
ably infinite dimensional (i.e., N — > oo), space]. Let the |i) (where i runs from 1 
to N) represent N independent ket vectors in this space. A general ket vector can 
be written 1 

N 

|A> = ^Oi|i>, (2.11) 

i=1 

where the <x\ are an arbitrary set of complex numbers. The only way the func- 
tional F can satisfy Eq. (2.10) for all vectors in the ket space is if 

N 

(F|(|A))=^f\^> (2.12) 

i=1 

1 Actually, this is only strictly true for finite-dimensional spaces. Only a special subset of denumerably infinite 
dimensional spaces have this property (i.e., they are complete), but since a ket space must be complete if it is to 
represent the states of a microscopic system, we need only consider this special subset. 
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where the ft are a set of complex numbers relating to the functional. 




(2.13) 



N 



<F| = L f ^ 



(2.14) 



But, this implies that the set of all possible linear functionals acting on an IN - 
dimensional ket space is itself an N -dimensional vector space. This type of vector 
space is called a bra space (after Dirac), and its constituent vectors (which are 
actually functionals of the ket space) are called bra vectors. Note that bra vectors 
are quite different in nature to ket vectors (hence, these vectors are written in 
mirror image notation, (• • • | and !•••), so that they can never be confused). Bra 
space is an example of what mathematicians call a dual vector space (i.e., it is 
dual to the original ket space). There is a one to one correspondence between 
the elements of the ket space and those of the related bra space. So, for every 
element A of the ket space, there is a corresponding element, which it is also 
convenient to label A, in the bra space. That is, 



where DC stands for dual correspondence. 

There are an infinite number of ways of setting up the correspondence between 
vectors in a ket space and those in the related bra space. However, only one 
of these has any physical significance. For a general ket vector A, specified by 
Eq. (2.11), the corresponding bra vector is written 



where the ocl are the complex conjugates of the oti. (A| is termed the dual vector 
to |A). It follows, from the above, that the dual to c(A| is c*|A), where c is a 
complex number. More generally, 




(2.15) 



N 




(2.16) 



ci|A)+c 2 |B) f^cj(A| + c!(B 



(2.17) 
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Recall that a bra vector is a functional which acts on a general ket vector, and 
spits out a complex number. Consider the functional which is dual to the ket 
vector 

N 

|B} = ^|3 t |i) (2.18) 

acting on the ket vector |A). This operation is denoted (B|(|A)). Note, however, 
that we can omit the round brackets without causing any ambiguity, so the oper- 
ation can also be written (B||A). This expression can be further simplified to give 
(B|A). According to Eqs. (2.11), (2.12), (2.16), and (2.18), 

N 

(B|A) = J" 0{ on. (2.19) 

Mathematicians term (B|A) the inner product of a bra and a ket. 2 An inner prod- 
uct is (almost) analogous to a scalar product between a covariant and contravari- 
ant vector in some curvilinear space. It is easily demonstrated that 

(B|A) = (A|B)*. (2.20) 

Consider the special case where |B) — > |A). It follows from Eqs. (2.12) and (2.20) 
that (A|A) is a real number, and that 

(A|A) > 0. (2.21) 

The equality sign only holds if |A) is the null ket [i.e., if all of the oci are zero in 
Eq. (2.11)]. This property of bra and ket vectors is essential for the probabilistic 
interpretation of quantum mechanics, as will become apparent later. 

Two kets | A) and |B) are said to be orthogonal if 

(A|B) = 0, (2.22) 

which also implies that (B|A) = 0. 

Given a ket |A) which is not the null ket, we can define a normalized ket |A), 
where 

|A) = (im) IA>> (2 - 23) 



2 We can now appreciate the elegance of Dirac's notation. The combination of a bra and a ket yields a "bra(c)ket" 
(which is just a number). 
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with the property 

(A|A) = 1. (2.24) 

Here, y/(A\A) is known as the norm or "length" of |A), and is analogous to the 
length, or magnitude, of a conventional vector. Since |A) and c|A) represent 
the same physical state, it makes sense to require that all kets corresponding to 
physical states have unit norms. 

It is possible to define a dual bra space for a ket space of nondenumerably 
infinite dimensions in much the same manner as that described above. The main 
differences are that summations over discrete labels become integrations over 
continuous labels, Kronecker delta-functions become Dirac delta-functions, com- 
pleteness must be assumed (it cannot be proved), and the normalization conven- 
tion is somewhat different. More of this later. 

2.6 Operators 

We have seen that a functional is a machine which inputs a ket vector and spits 
out a complex number. Consider a somewhat different machine which inputs a 
ket vector and spits out another ket vector in a deterministic fashion. Mathemati- 
cians call such a machine an operator. We are only interested in operators which 
preserve the linear dependencies of the ket vectors upon which they act. Such 
operators are termed linear operators. Consider an operator labeled X. Suppose 
that when this operator acts on a general ket vector |A) it spits out a new ket 
vector which is denoted X|A). Operator X is linear provided that 

X(|A) + |B» =X|A)+X|B), (2.25) 

for all ket vectors |A) and |B), and 

X(c|A» = cX|A), (2.26) 

for all complex numbers c. Operators X and Y are said to be equal if 

X|A) = Y|A) (2.27) 
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for all kets in the ket space in question. Operator X is termed the null operator if 

X|A> = |0) (2.28) 

for all ket vectors in the space. Operators can be added together. Such addition 
is defined to obey a commutative and associate algebra: 

X + Y = Y + X, (2.29) 
X+(Y + Z) = (X + Y) + Z. (2.30) 

Operators can also be multiplied. The multiplication is associative: 

X(Y|A» = (XY)|A> =XY|A), (2.31) 
X(YZ) = (XY)Z = XYZ. (2.32) 

However, in general, it is noncommutative: 

XY^YX. (2.33) 



So far, we have only considered linear operators acting on ket vectors. We can 
also give a meaning to their operating on bra vectors. Consider the inner product 
of a general bra (B| with the ket X|A). This product is a number which depends 
linearly on |A). Thus, it may be considered to be the inner product of |A) with 
some bra. This bra depends linearly on (B|, so we may look on it as the result of 
some linear operator applied to (B|. This operator is uniquely determined by the 
original operator X, so we might as well call it the same operator acting on |B). A 
suitable notation to use for the resulting bra when X operates on (B| is (B|X. The 
equation which defines this vector is 

((B|X)|A) = (B|(X|A» (2.34) 

for any |A) and (B|. The triple product of (B|, X, and |A) can be written (B|X|A) 
without ambiguity, provided we adopt the convention that the bra vector always 
goes on the left, the operator in the middle, and the ket vector on the right. 

Consider the dual bra to X|A). This bra depends antilinearly on |A) and must 
therefore depend linearly on (A|. Thus, it may be regarded as the result of some 
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linear operator applied to (A|. This operator is termed the adjoint of X, and is 
denoted X^. Thus, 

X|A) (A|X f . (2.35) 

It is readily demonstrated that 

(B|X f |A) = (A|X|B)*, (2.36) 

plus 

(XY) f = Y^Xl (2.37) 

It is also easily seen that the adjoint of the adjoint of a linear operator is equiva- 
lent to the original operator. A Hermitian operator £, has the special property that 
it is its own adjoint: i.e., 

I = £, f . (2.38) 



2.7 The outer product 

So far we have formed the following products: (B|A), X|A), (A|X, X Y, (B|X|A). 
Are there any other products we are allowed to form? How about 

|B)(A|? (2.39) 

This clearly depends linearly on the ket |A) and the bra |B). Suppose that we 
right-multiply the above product by the general ket |C). We obtain 

|B)(A|C) = (A|C)|B), (2.40) 

since (A|C) is just a number. Thus, |B)(A| acting on a general ket |C) yields 
another ket. Clearly, the product |B)(A| is a linear operator. This operator also 
acts on bras, as is easily demonstrated by left-multiplying the expression (2.39) 
by a general bra (C|. It is also easily demonstrated that 

(|B)(A|)+ = |A)(B|. (2.41) 

Mathematicians term the operator |B)(A| the outer product of |B) and (A|. The 
outer product should not be confused with the inner product, (A|B), which is just 
a number. 
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2.8 Eigenvalues and eigenvectors 

In general, the ket X|A) is not a constant multiple of |A). However, there are 
some special kets known as the eigenkets of operator X. These are denoted 

|x'),|x"),|x'">..., (2.42) 

and have the property 

X|x') = xV) , X|x") = x"\x") . . . , (2.43) 

where x', x", . . . are numbers called eigenvalues. Clearly, applying X to one of its 
eigenkets yields the same eigenket multiplied by the associated eigenvalue. 

Consider the eigenkets and eigenvalues of a Hermitian operator £,. These are 
denoted 

SIO = W), (2-44) 

where |£/) is the eigenket associated with the eigenvalue £/. Three important 
results are readily deduced: 

(i) The eigenvalues are all real numbers, and the eigenkets corresponding to 
different eigenvalues are orthogonal. Since £, is Hermitian, the dual equation to 
Eq. (2.44) (for the eigenvalue £,") reads 

= £"*<£"!. (2-45) 

If we left-multiply Eq. (2.44) by (E,"\, right-multiply the above equation by |£/), 
and take the difference, we obtain 

=0 . (2.46) 

Suppose that the eigenvalues £/ and E," are the same. It follows from the above 
that 

£/ = £/*, (2.47) 

where we have used the fact that |£/} is not the null ket. This proves that the 
eigenvalues are real numbers. Suppose that the eigenvalues £/ and E," are differ- 
ent. It follows that 

= 0, (2.48) 
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which demonstrates that eigenkets corresponding to different eigenvalues are 
orthogonal. 

(ii) The eigenvalues associated with eigenkets are the same as the eigenvalues 
associated with eigenbras. An eigenbra of £, corresponding to an eigenvalue £/ is 
defined 

(£/|£, = (£/|£/. (2.49) 

(Hi) The dual of any eigenket is an eigenbra belonging to the same eigenvalue, 
and conversely. 

2.9 Observables 

We have developed a mathematical formalism which comprises three types of 
objects — bras, kets, and linear operators. We have already seen that kets can be 
used to represent the possible states of a microscopic system. However, there is 
a one to one correspondence between the elements of a ket space and its dual 
bra space, so we must conclude that bras could just as well be used to repre- 
sent the states of a microscopic system. What about the dynamical variables of 
the system (e.g., its position, momentum, energy, spin, etc.)? How can these be 
represented in our formalism? Well, the only objects we have left over are oper- 
ators. We, therefore, assume that the dynamical variables of a microscopic system 
are represented as linear operators acting on the bras and kets which correspond to 
the various possible states of the system. Note that the operators have to be linear, 
otherwise they would, in general, spit out bras/kets pointing in different direc- 
tions when fed bras/kets pointing in the same direction but differing in length. 
Since the lengths of bras and kets have no physical significance, it is reasonable 
to suppose that non-linear operators are also without physical significance. 

We have seen that if we observe the polarization state of a photon, by placing 
a polaroid film in its path, the result is to cause the photon to jump into a state 
of polarization parallel or perpendicular to the optic axis of the film. The former 
state is absorbed, and the latter state is transmitted (which is how we tell them 
apart). In general, we cannot predict into which state a given photon will jump 
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(except in a statistical sense) . However, we do know that if the photon is initially 
polarized parallel to the optic axis then it will definitely be absorbed, and if it is 
initially polarized perpendicular to the axis then it will definitely be transmitted. 
We also known that after passing though the film a photon must be in a state of 
polarization perpendicular to the optic axis (otherwise it would not have been 
transmitted). We can make a second observation of the polarization state of 
such a photon by placing an identical polaroid film (with the same orientation of 
the optic axis) immediately behind the first film. It is clear that the photon will 
definitely be transmitted through the second film. 

There is nothing special about the polarization states of a photon. So, more 
generally, we can say that when a dynamical variable of a microscopic system 
is measured the system is caused to jump into one of a number of independent 
states (note that the perpendicular and parallel polarization states of our photon 
are linearly independent) . In general, each of these final states is associated with 
a different result of the measurement: i.e., a different value of the dynamical 
variable. Note that the result of the measurement must be a real number (there 
are no measurement machines which output complex numbers). Finally, if an 
observation is made, and the system is found to be a one particular final state, 
with one particular value for the dynamical variable, then a second observation, 
made immediately after the first one, will definitely find the system in the same 
state, and yield the same value for the dynamical variable. 

How can we represent all of these facts in our mathematical formalism? Well, 
by a fairly non-obvious leap of intuition, we are going to assert that a measure- 
ment of a dynamical variable corresponding to an operator X in ket space causes 
the system to jump into a state corresponding to one of the eigenkets of X. Not 
surprisingly, such a state is termed an eigenstate. Furthermore, the result of the 
measurement is the eigenvalue associated with the eigenket into which the system 
jumps. The fact that the result of the measurement must be a real number implies 
that dynamical variables can only be represented byHermitian operators (since only 
Hermitian operators are guaranteed to have real eigenvalues). The fact that the 
eigenkets of a Hermitian operator corresponding to different eigenvalues (i.e., dif- 
ferent results of the measurement) are orthogonal is in accordance with our ear- 
lier requirement that the states into which the system jumps should be mutually 
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independent. We can conclude that the result of a measurement of a dynamical 
variable represented by a Hermitian operator £, must be one of the eigenvalues of 
£,. Conversely every eigenvalue of £, is a possible result of a measurement made 
on the corresponding dynamical variable. This gives us the physical significance 
of the eigenvalues. (From now on, the distinction between a state and its rep- 
resentative ket vector, and a dynamical variable and its representative operator, 
will be dropped, for the sake of simplicity.) 

It is reasonable to suppose that if a certain dynamical variable £, is measured 
with the system in a particular state, then the states into which the system may 
jump on account of the measurement are such that the original state is dependent 
on them. This fairly innocuous statement has two very important corollaries. 
First, immediately after an observation whose result is a particular eigenvalue £/, 
the system is left in the associated eigenstate. However, this eigenstate is orthog- 
onal to (i.e., independent of) any other eigenstate corresponding to a different 
eigenvalue. It follows that a second measurement made immediately after the 
first one must leave the system in an eigenstate corresponding to the eigenvalue 
£/. In other words, the second measurement is bound to give the same result as 
the first. Furthermore, if the system is in an eigenstate of corresponding to an 
eigenvalue E,', then a measurement of £, is bound to give the result £/. This follows 
because the system cannot jump into an eigenstate corresponding to a different 
eigenvalue of £,, since such a state is not dependent on the original state. Second, 
it stands to reason that a measurement of £, must always yield some result. It fol- 
lows that no matter what the initial state of the system, it must always be able to 
jump into one of the eigenstates of £,. In other words, a general ket must always 
be dependent on the eigenkets of £,. This can only be the case if the eigenkets 
form a complete set (i.e., they span ket space). Thus, in order for a Hermitian oper- 
ator £, to be observable its eigenkets must form a complete set. A Hermitian operator 
which satisfies this condition is termed an observable. Conversely, any observable 
quantity must be a Hermitian operator with a complete set of eigenstates. 
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2.10 Measurements 

We have seen that a measurement of some observable £, of a microscopic system 
causes the system to jump into one of the eigenstates of £,. The result of the 
measurement is the associated eigenvalue (or some function of this quantity) . It 
is impossible to determine into which eigenstate a given system will jump, but it is 
possible to predict the probability of such a transition. So, what is the probability 
that a system in some initial state |A) makes a transition to an eigenstate If,') of an 
observable £,, as a result of a measurement made on the system? Let us start with 
the simplest case. If the system is initially in an eigenstate |£/) then the transition 
probability to a eigenstate |£/') corresponding to a different eigenvalue is zero, 
and the transition probability to the same eigenstate |£/) is unity. It is convenient 
to normalize our eigenkets such that they all have unit norms. It follows from the 
orthogonality property of the eigenkets that 



where is unity if £/ = £,", and zero otherwise. For the moment, we are 
assuming that the eigenvalues of £, are all different. 

Note that the probability of a transition from an initial eigenstate |£/} to a fi- 
nal eigenstate \E,") is the same as the value of the inner product Can we 
use this correspondence to obtain a general rule for calculating transition prob- 
abilities? Well, suppose that the system is initially in a state |A) which is not an 
eigenstate of £,. Can we identify the transition probability to a final eigenstate 
|£/) with the inner product (A|£/)? The straight answer is "no", since (A|£/) is, in 
general, a complex number, and complex probabilities do not make much sense. 
Let us try again. How about if we identify the transition probability with the mod- 
ulus squared of the inner product, |(A|£/)| 2 ? This quantity is definitely a positive 
number (so it could be a probability). This guess also gives the right answer for 
the transition probabilities between eigenstates. In fact, it is the correct guess. 

Since the eigenstates of an observable £, form a complete set, we can express 
any given state | A) as a linear combination of them. It is easily demonstrated that 



(2.50) 




(2.51) 
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<A| = 2jA|£/)(£/|, (2.52) 

(A|A) = ^(A|£, , }(^IA) = LK a IOI 2 > (2-53) 

where the summation is over all the different eigenvalues of £,, and use has been 
made of Eq. (2.20), and the fact that the eigenstates are mutually orthogonal. 
Note that all of the above results follow from the extremely useful (and easily 
proved) result 

2jO(£/l = 1> (2-54) 

I 1 

where 1 denotes the identity operator. The relative probability of a transition to 
an eigenstate |£/), which is equivalent to the relative probability of a measure- 
ment of £, yielding the result £/, is 

P(£/)oc|(A|£/)l 2 . (2.55) 
The absolute probability is clearly 

Ptt') = J^L = S (2.56) 



(A|A> ■ 

If the ket | A) is normalized such that its norm is unity then this probability simply 
reduces to 

P(0 = KMOI 2 - (2-57) 



2.11 Expectation values 

Consider an ensemble of microscopic systems prepared in the same initial state 
|A). Suppose a measurement of the observable £, is made on each system. We 
know that each measurement yields the value £/ with probability P(£/). What is 
the mean value of the measurement? This quantity which is generally referred 
to as the expectation value of £,, is given by 

<£,) = ^^P(0=L^I(AI0I 2 
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= Y_ */<A|0(£'IA) = 2jA|£J£/)(£/|A), (2.58) 

which reduces to 

(£,) = (A|£JA) (2.59) 

with the aid of Eq. (2.54). 

Consider the identity operator, 1. All states are eigenstates of this operator 
with the eigenvalue unity. Thus, the expectation value of this operator is always 
unity: i.e., 

(A|1|A) = (A|A) = 1, (2.60) 

for all |A). Note that it is only possible to normalize a given ket |A) such that 
Eq. (2.60) is satisfied because of the more general property (2.21) of the norm. 
This property depends on the particular correspondence (2.16), that we adopted 
earlier, between the elements of a ket space and those of its dual bra space. 



2.12 Degeneracy 

Suppose that two different eigenstates \E,' a ) and |£,£) of £, correspond to the same 
eigenvalue £/. These are termed degenerate eigenstates. Degenerate eigenstates 
are necessarily orthogonal to any eigenstates corresponding to different eigen- 
values, but, in general, they are not orthogonal to each other (i.e., the proof of 
orthogonality given in Sect. 2.8 does not work in this case). This is unfortunate, 
since much of the previous formalism depends crucially on the mutual orthogo- 
nality of the different eigenstates of an observable. Note, however, that any linear 
combination of \E,' a ) and )£,{,} is also an eigenstate corresponding to the eigenvalue 
£/. It follows that we can always construct two mutually orthogonal degenerate 
eigenstates. For instance, 



ft) = lO> (2-61) 



\1' 2 ) = |£, 9 ffil^lffi - (2.62) 
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This result is easily generalized to the case of more than two degenerate eigen- 
states. We conclude that it is always possible to construct a complete set of mu- 
tually orthogonal eigenstates for any given observable. 

2.13 Compatible observables 

Suppose that we wish to simultaneously measure two observables, £, and n, of 
a microscopic system? Let us assume that we possess an apparatus which is ca- 
pable of measuring £,, and another which can measure n. For instance, the two 
observables in question might be the projection in the x- and z-directions of the 
spin angular momentum of a spin one-half particle. These could be measured us- 
ing appropriate Stern-Gerlach apparatuses (see Sakurai, Sect. 1.1). Suppose that 
we make a measurement of £,, and the system is consequently thrown into one 
of the eigenstates of £,, |£/}, with eigenvalue £/. What happens if we now make 
a measurement of u? Well, suppose that the eigenstate |£/) is also an eigenstate 
of rj, with eigenvalue u/. In this case, a measurement of r) will definitely give the 
result u/. A second measurement of £, will definitely give the result £/, and so on. 
In this sense, we can say that the observables £, and rj simultaneously have the 
values £/ and u/, respectively. Clearly, if all eigenstates of £, are also eigenstates 
of u then it is always possible to make a simultaneous measurement of £, and u. 
Such observables are termed compatible. 

Suppose, however, that the eigenstates of £, are not eigenstates of r|. Is it 
still possible to measure both observables simultaneously? Let us again make an 
observation of £, which throws the system into an eigenstate |£/), with eigenvalue 
£/. We can now make a second observation to determine rj. This will throw 
the system into one of the (many) eigenstates of r| which depend on |£/}. In 
principle, each of these eigenstates is associated with a different result of the 
measurement. Suppose that the system is thrown into an eigenstate In/}, with 
the eigenvalue u/. Another measurement of £, will throw the system into one 
of the (many) eigenstates of £, which depend on In/). Each eigenstate is again 
associated with a different possible result of the measurement. It is clear that if 
the observables £, and r) do not possess simultaneous eigenstates then if the value 
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of £, is known (i.e., the system is in an eigenstate of £,) then the value of u is 
uncertain (i.e., the system is not in an eigenstate of u), and vice versa. We say 
that the two observables are incompatible. 

We have seen that the condition for two observables £, and rj to be simultane- 
ously measurable is that they should possess simultaneous eigenstates (i.e., every 
eigenstate of £, should also be an eigenstate of rj). Suppose that this is the case. 
Let a general eigenstate of £,, with eigenvalue £/, also be an eigenstate of rj, with 
eigenvalue n/. It is convenient to denote this simultaneous eigenstate If/u/)- We 
have 

£,|£, , n / ) = £/|£/r,'}, (2.63) 
TifcV) = tlUY>. (2-64) 

We can left-multiply the first equation by n, and the second equation by £,, and 
then take the difference. The result is 

^u-u^VHIO} (2.65) 

for each simultaneous eigenstate. Recall that the eigenstates of an observable 
must form a complete set. It follows that the simultaneous eigenstates of two 
observables must also form a complete set. Thus, the above equation implies that 

(£,r|-r|£J|A) = |0), (2.66) 

where | A) is a general ket. The only way that this can be true is if 

£ti = u£,. (2.67) 

Thus, the condition for two observables £, and n to be simultaneously measurable is 
that they should commute. 

2.14 The uncertainty relation 

We have seen that if £, and n are two noncommuting observables, then a deter- 
mination of the value of £, leaves the value of n uncertain, and vice versa. It is 
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possible to quantify this uncertainty. For a general observable £,, we can define a 
Hermitian operator 

A£, = £,-(£,), (2.68) 

where the expectation value is taken over the particular physical state under con- 
sideration. It is obvious that the expectation value of A£, is zero. The expectation 
value of (A£,) 2 = A£, A£, is termed the variance of £,, and is, in general, non-zero. 
In fact, it is easily demonstrated that 

<(A£J 2 ) = (£, 2 ) - <£,) 2 . (2.69) 

The variance of £, is a measure of the uncertainty in the value of £, for the particu- 
lar state in question (i.e., it is a measure of the width of the distribution of likely 
values of £, about the expectation value) . If the variance is zero then there is no 
uncertainty, and a measurement of £, is bound to give the expectation value, (£,). 

Consider the Schwarz inequality 

(A|A)(B|B) > |(A|B}| 2 , (2.70) 

which is analogous to 

|a| 2 |b| 2 > |a-b| 2 (2.71) 
in Euclidian space. This inequality can be proved by noting that 

«A| + c*<B|)(|A)+c|B)) >0, (2.72) 

where c is any complex number. If c takes the special value — (B|A)/(B|B) then 
the above inequality reduces to 

(A|A)(B|B)-|<A|B}| 2 >0, (2.73) 

which is the same as the Schwarz inequality. 

Let us substitute 

|A> = A£J), (2.74) 
IB) = Aril), (2.75) 
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into the Schwarz inequality, where the blank ket | } stands for any general ket. 
We find 

<(A0 2 )<(Au) 2 )>|<A£,Ar,)| 2 , (2.76) 

where use has been made of the fact that A£, and An. are Hermitian operators. 
Note that 

A£,Arj = 1 [A£,, An] + I{A£,, An}, (2.77) 

where the commutator, [A£,, An], and the anti-commutator, {A£,, An.}, are defined 

[A£,,Au] ee A£, An — An A£,, (2.78) 

{A£,,Au} = A£, An + An A£,. (2.79) 

The commutator is clearly anti-Hermitian, 

([A£,, Au]) f = (A£, An - An A^ = An A£, - A£, An = - [A£,, An] , (2.80) 

whereas the anti-commutator is obviously Hermitian. Now, it is easily demon- 
strated that the expectation value of a Hermitian operator is a real number, 
whereas the expectation value of an anti-Hermitian operator is a pure imaginary 
number. It is clear that the right hand side of 

(A£, An) = 1<[A£, An]) + ~({A£,, An}}, (2.81) 

consists of the sum of a purely real and a purely imaginary number. Taking the 
modulus squared of both sides gives 

|<A£,Au)| 2 = ] 4 \([^])\ 2 + 1|({A£,, An})| 2 , (2.82) 

where use has been made of (AE) = 0, etc. The final term in the above expression 
is positive definite, so we can write 

((A0 2 )<(An) 2 )>l|(^,n])| 2 , (2.83) 

where use has been made of Eq. (2.76). The above expression is termed the 
uncertainty relation. According to this relation, an exact knowledge of the value 
of £, implies no knowledge whatsoever of the value of n., and vice versa. The one 
exception to this rule is when £, and rj commute, in which case exact knowledge 
of £, does not necessarily imply no knowledge of rj. 
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2.15 Continuous spectra 

Up to now, we have studiously avoided dealing with observables possessing eigen- 
values which lie in a continuous range, rather than having discrete values. The 
reason for this is because continuous eigenvalues imply a ket space of nonde- 
numerably infinite dimension. Unfortunately, continuous eigenvalues are un- 
avoidable in quantum mechanics. In fact, the most important observables of all, 
namely position and momentum, generally have continuous eigenvalues. Fortu- 
nately, many of the results we obtained previously for a finite-dimensional ket 
space with discrete eigenvalues can be generalized to ket spaces of nondenumer- 
ably infinite dimensions. 

Suppose that £, is an observable with continuous eigenvalues. We can still 
write the eigenvalue equation as 

£JO = £/lO- (2-84) 

But, £/ can now take a continuous range of values. Let us assume, for the sake of 
simplicity, that £/ can take any value. The orthogonality condition (2.50) gener- 
alizes to 

<£/|£,"} = 6(£/ -£,"), (2.85) 

where 6(x) denotes the famous Dirac delta-function. Note that there are clearly a 
nondenumerably infinite number of mutually orthogonal eigenstates of £,. Hence, 
the dimensionality of ket space is nondenumerably infinite. Note, also, that eigen- 
states corresponding to a continuous range of eigenvalues cannot be normalized 
so that they have unit norms. In fact, these eigenstates have infinite norms: i.e., 
they are infinitely long. This is the major difference between eigenstates in a 
finite-dimensional and an infinite-dimensional ket space. The extremely useful 
relation (2.54) generalizes to 

d£/|£/)<£/| = 1. (2.86) 

Note that a summation over discrete eigenvalues goes over into an integral over 
a continuous range of eigenvalues. The eigenstates |£/) must form a complete set 
if £, is to be an observable. It follows that any general ket can be expanded in 
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terms of the |£/). In fact, the expansions (2.51)-(2.53) generalize to 



|A> 



(M 



(A|A) 



d£/|£/)(£/|A>, 



d£/ (MO^'I. 
d£,' (A|£/)(*/IA) 



d£/|(A|£, 



/\|2 



(2.87) 



(2.88) 



(2.89) 



These results also follow simply from Eq. (2.86). We have seen that it is not possi- 
ble to normalize the eigenstates |£/) such that they have unit norms. Fortunately 
this convenient normalization is still possible for a general state vector. In fact, 
according to Eq. (2.89), the normalization condition can be written 



(A|A> 



d£/|(A|£/)l 2 = 1 



(2.90) 



We have now studied observables whose eigenvalues can take a discrete num- 
ber of values as well as those whose eigenvalues can take any value. There are 
number of other cases we could look at. For instance, observables whose eigen- 
values can only take a finite range of values, or observables whose eigenvalues 
take on a finite range of values plus a set of discrete values. Both of these cases 
can be dealt with using a fairly straight-forward generalization of the previous 
analysis (see Dirac, Cha. II and III). 
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3.1 Introduction 



So far, we have considered general dynamical variables represented by general 
linear operators acting in ket space. However, in classical mechanics the most 
important dynamical variables are those involving position and momentum. Let 
us investigate the role of such variables in quantum mechanics. 

In classical mechanics, the position q and momentum p of some component 
of a dynamical system are represented as real numbers which, by definition, com- 
mute. In quantum mechanics, these quantities are represented as noncommuting 
linear Hermitian operators acting in a ket space which represents all of the pos- 
sible states of the system. Our first task is to discover a quantum mechanical 
replacement for the classical result q p — p q = 0. Do the position and momen- 
tum operators commute? If not, what is the value of q p — p q? 



3.2 Poisson brackets 



Consider a dynamic system whose state at a particular time t is fully specified 
by N independent classical coordinates qt (where i runs from 1 to N). Associ- 
ated with each generalized coordinate q t is a classical canonical momentum p t . 
For instance, a Cartesian coordinate has an associated linear momentum, an an- 
gular coordinate has an associated angular momentum, etc. As is well-known, 
the behaviour of a classical system can be specified in terms of Lagrangian or 
Hamiltonian dynamics. For instance, in Hamiltonian dynamics, 

dCIi 3H (3.1) 



dt 3pi ' 
dpt _3H 
dt 3qt 



(3.2) 



where the function H(qt,pi,t) is the energy of the system at time t expressed 
in terms of the classical coordinates and canonical momenta. This function is 
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usually referred to as the Hamiltonian of the system. 

We are interested in finding some construct of classical dynamics which con- 
sists of products of dynamical variables. If such a construct exists we hope to 
generalize it somehow to obtain a rule describing how dynamical variables com- 
mute with one another in quantum mechanics. There is, indeed, one well-known 
construct in classical dynamics which involves products of dynamical variables. 
The Poisson bracket of two dynamical variables u and v is defined 

l — \3qi3pi 3pi3qi ; 



where u and v are regarded as functions of the coordinates and momenta q t and 
Pi. It is easily demonstrated that 

[qt,qj = 0, (3.4) 
[pt,Pj] = 0, (3.5) 
[qt,pj = 6tj. (3.6) 

The time evolution of a dynamical variable can also be written in terms of a 
Poisson bracket by noting that 

du y~ / 3u dqi 9u &pi\ 

~ 2— \ + 



dt j—f \3qi dt 3pi dt 

N 

= L 



N ' 9u 9H 3u 9H 



i=1 



= [u,H], (3.7) 

where use has been made of Hamilton's equations. 

Can we construct a quantum mechanical Poisson bracket in which u and v are 
noncommuting operators, instead of functions? Well, the main properties of the 
classical Poisson bracket are as follows: 

[u,v] = -[v,u], (3.8) 
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[u,c] = 0, (3.9) 

[ui+u 2 ,v] = [U],V] + [u 2 ,v], (3.10) 

[u,vi+v 2 ] = [u,vi] + [u,v 2 ] (3.11) 

[uiu 2 ,v] = [ui,v]u 2 + ui[u 2 ,v], (3.12) 

[u,V]V 2 ] = [u,vi]v 2 + vi[u,v 2 ], (3.13) 

and 

[u, [v,w]] + [v, [w,u]] + [w, [u,v]] = 0. (3.14) 

The last relation is known as the Jacobi identity. In the above, u, v, w, etc., 
represent dynamical variables, and c represents a number. Can we find some 
combination of noncommuting operators u and v, etc., which satisfies all of the 
above relations? 

Well, we can evaluate the Poisson bracket [ui u 2 , vi v 2 ] in two different ways, 
since we can use either of the formulae (3.12) or (3.13) first. Thus, 

[uiu 2 ,V]V 2 ] = [ui,V] v 2 ]u 2 + u 1 [u 2 ,v 1 v 2 ] (3.15) 

= {[ui,vi]v 2 + v 1 [ui,v 2 ]}u 2 + Ui {[U 2 ,V]]V 2 + V![U 2 ,V 2 ]} 

= [Ui , V]]V 2 U 2 + V] [U] , V 2 ]U 2 + U] [U 2 , V]]V 2 + U] V] [u 2 , v 2 ] , 

and 

[uiu 2 ,viv 2 ] = [ui u 2 ,V]]v 2 + vi[ui u 2 ,v 2 ] (3.16) 

= [u 1 ,v 1 ]u 2 v 2 + u 1 [u 2 ,v 1 ]v 2 + v 1 [ui,v 2 ]u 2 +Vi U][U 2 ,V 2 ]. 

Note that the order of the various factors has been preserved, since they now 
represent noncommuting operators. Equating the above two results yields 

[u 1 ,v 1 ](u 2 v 2 - v 2 u 2 ) = (ui V] -V] ui)[u 2 ,v 2 ]. (3.17) 

Since this relation must hold for ui and vi quite independent of u 2 and v 2 , it 
follows that 

ui vi — vi ui = iPi[ui,vi], (3.18) 
u 2 v 2 — v 2 u 2 = ifi[u 2 ,v 2 ], (3.19) 
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where h does not depend on ui, vi, uz, vz, and also commutes with (ui vi — vi ui ). 
Since ui, etc., are quite general operators, it follows that Pi is just a number. We 
want the quantum mechanical Poisson bracket of two Hermitian operators to 
be an Hermitian operator itself, since the classical Poisson bracket of two real 
dynamical variables is real. This requirement is satisfied if h is a real number. 
Thus, the quantum mechanical Poisson bracket of two dynamical variables u and 
v is given by 

uv — vu 

u,v = — , (3.20) 

1 Ft 

where h is a new universal constant of nature. Quantum mechanics agrees with 
experiments provided that h takes the value h/2n, where 

h= 6.6261 x 10 34 Js (3.21) 

is Planck's constant. Somewhat confusingly, the notation [u,v] is convention- 
ally reserved for the commutator uv — vu in quantum mechanics. We will use 
[u, v] quantum to denote the quantum Poisson bracket. Thus, 

[u,v] 
ill 

It is easily demonstrated that the quantum mechanical Poisson bracket, as defined 
above, satisfies all of the relations (3.8)-(3.14). 



[U-> v] quantum — . T _ ■ (3.22) 



The strong analogy we have found between the classical Poisson bracket, 
defined in Eq. (3.3), and the quantum mechanical Poisson bracket, defined in 
Eq. (3.22), leads us to make the assumption that the quantum mechanical bracket 
has the same value as the corresponding classical bracket, at least for the simplest 
cases. In other words, we are assuming that Eqs. (3.4)-(3.6) hold for quantum 
mechanical as well as classical Poisson brackets. This argument yields the funda- 
mental commutation relations 

[qt,qj] = 0, (3.23) 
[pt,Pj] = 0, (3.24) 
[q i)Pj ] = iTiSij. (3.25) 

These results provide us with the basis for calculating commutation relations be- 
tween general dynamical variables. For instance, if two dynamical variables, £, 
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and rj, can both be written as a power series in the c\\ and pi, then repeated 
application of Eqs. (3.8)-(3.13) allows [£,,r|] to be expressed in terms of the fun- 
damental commutation relations (3.23)-(3.25). 

Equations (3.23)-(3.25) provide the foundation for the analogy between quan- 
tum mechanics and classical mechanics. Note that the classical result (that every- 
thing commutes) is obtained in the limit Pi — > 0. Thus, classical mechanics can be 
regarded as the limiting case of quantum mechanics when hgoes to zero. In classi- 
cal mechanics, each pair of generalized coordinate and its conjugate momentum, 
qi and pi, correspond to a different classical degree of freedom of the system. It is 
clear from Eqs. (3.23)-(3.25) that in quantum mechanics the dynamical variables 
corresponding to different degrees of freedom all commute. It is only those variables 
corresponding to the same degree of freedom which may fail to commute. 

3.3 Wave-functions 

Consider a simple system with one classical degree of freedom, which corre- 
sponds to the Cartesian coordinate x. Suppose that x is free to take any value 
(e.g., x could be the position of a free particle). The classical dynamical vari- 
able x is represented in quantum mechanics as a linear Hermitian operator which 
is also called x. Moreover, the operator x possesses eigenvalues x' lying in the 
continuous range — oo < x' < +oo (since the eigenvalues correspond to all the 
possible results of a measurement of x). We can span ket space using the suit- 
ably normalized eigenkets of x. An eigenket corresponding to the eigenvalue x' 
is denoted |x'). Moreover, [see Eq. (2.85)] 

(x'|x") = 5(x'-x"). (3.26) 
The eigenkets satisfy the extremely useful relation [see Eq. (2.86)] 

dx'lx'Xx'l = 1. (3.27) 

J — oo 

This formula expresses the fact that the eigenkets are complete, mutually orthog- 
onal, and suitably normalized. 
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A state ket |A) (which represents a general state A of the system) can be 
expressed as a linear superposition of the eigenkets of the position operator using 
Eq. (3.27). Thus, 



r+oo 



|A> 



dx'(x'IA)lx') (3.28) 



The quantity (x'|A) is a complex function of the position eigenvalue x'. We can 
write 

(x'|A) =ip A (x / ). (3.29) 

Here, i^aIx') is the famous wave-function of quantum mechanics. Note that state 
A is completely specified by its wave-function iPa(x') [since the wave-function 
can be used to reconstruct the state ket |A) using Eq. (3.28)]. It is clear that 
the wave-function of state A is simply the collection of the weights of the cor- 
responding state ket |A), when it is expanded in terms of the eigenkets of the 
position operator. Recall, from Sect. 2.10, that the probability of a measurement 
of a dynamical variable £, yielding the result £/ when the system is in state A is 
given by |(£/|A)| 2 , assuming that the eigenvalues of £, are discrete. This result is 
easily generalized to dynamical variables possessing continuous eigenvalues. In 
fact, the probability of a measurement of x yielding a result lying in the range 
x' to x' + dx' when the system is in a state |A) is |(x'|A)| 2 dx'. In other words, 
the probability of a measurement of position yielding a result in the range x' to 
x' + dx' when the wave-function of the system is iPa(x') is 

P(x', dx') = |iMx')l 2 dx'. (3.30) 

This formula is only valid if the state ket |A) is properly normalized: i.e., if 
(A|A) = 1 . The corresponding normalization for the wave-function is 

-oo 

N> A (x')l 2 dx' = 1. (3.31) 

-oo 

Consider a second state B represented by a state ket |B) and a wave-function 
iPb(x'). The inner product (B|A) can be written 



B|A) 



dx^Blx'Xx'IA) 



i^(x')al) A (x')dx', (3.32) 
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where use has been made of Eqs. (3.27) and (3.29). Thus, the inner product of 
two states is related to the overlap integral of their wave-functions. 

Consider a general function f(x) of the observable x [e.g., f(x) = x 2 ]. If |B) = 
f (x)|A) then it follows that 



iMx') = (x'|f(x) 



dx"a|> A (x")|x" x 

-oo 



dx"f{x")^ A {x")(x'\x"), (3.33) 



giving 



ij)B(x') =f(x , )ip A (x / ), (3.34) 

where use has been made of Eq. (3.26). Here, f (x') is the same function of the 
position eigenvalue x' that f (x) is of the position operator x: i.e., if f (x) = x 2 then 
f (x') = x /2 . It follows, from the above result, that a general state ket |A) can be 
written 

|A> =i|>a(x)), (3.35) 

where iPa(x) is the same function of the operator x that the wave-function "4>a(x') 
is of the position eigenvalue x' , and the ket ) has the wave-function "4>(x') = 1. 
The ket ) is termed the standard ket. The dual of the standard ket is termed the 
standard bra, and is denoted (. It is easily seen that 

(i|) A (x) f^iMx)}. (3.36) 

Note, finally, that ip A (x)) is often shortened to t\*a), leaving the dependence on 
the position operator x tacitly understood. 



3.4 Schrodinger's representation - 1 

Consider the simple system described in the previous section. A general state ket 
can be written ip(x)), where ip(x) is a general function of the position operator x, 
and "4>(x') is the associated wave-function. Consider the ket whose wave-function 
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is dty[x')/dx'. This ket is denoted dip/dx). The new ket is clearly a linear func- 
tion of the original ket, so we can think of it as the result of some linear operator 
acting on ip). Let us denote this operator d/dx. It follows that 



dx^ / dx' 



(3.37) 



Any linear operator which acts on ket vectors can also act on bra vectors. 
Consider d/dx acting on a general bra (c()(x). According to Eq. (2.34), the bra 
(fy d/dx satisfies 

(<*S *>=<*(£»)• (3 - 38) 

Making use of Eqs. (3.27) and (3.29), we can write 



(4) — |x') dx'iHx') 
dx 



,+ °° i\ j ,&Mx') 

cp(x J dx 

oo 



dx' 



(3.39) 



The right-hand side can be transformed via integration by parts to give 



r+oo 



(ct>— |x') dx>(x') 
dx 



' + °° dc^x') ,,,,,, 
, , dx i|)(x ), 

rv-i Q-X 



(3.40) 



assuming that the contributions from the limits of integration vanish. It follows 
that 

d , d(j)(x') 
(^)=-^> (3.41) 



(3.42) 



which implies 

d^ 

dx dx 

The neglect of contributions from the limits of integration in Eq. (3.40) is rea- 
sonable because physical wave-functions are square-integrable [see Eq. (3.31)]. 
Note that 

dip. dc ,dip* 



A^) _■ dl K ( DC ) ( d ^* _ -(xh* A 
dx dx dx dx' 



(3.43) 



where use has been made of Eq. (3.42). It follows, by comparison with Eqs. (2.35) 
and (3.36), that 

/ d \ f _ _ d 
dx' 



dx 



(3.44) 



40 



3.4 Schrddinger's representation - 1 



3 POSITION AND MOMENTUM 



Thus, d/dx is an anti-Hermitian operator. 

Let us evaluate the commutation relation between the operators x and d/dx. 
We have 

Since this holds for any ket it follows that 

A X _ X A = -|. (3.46) 
dx dx 

Let p be the momentum conjugate to x (for the simple system under consideration 
p is a straight-forward linear momentum). According to Eq. (3.25), x and p 
satisfy the commutation relation 

xp — px = i?i. (3.47) 

It can be seen, by comparison with Eq. (3.46), that the Hermitian operator 
— ihd/dx satisfies the same commutation relation with x that p does. The most 
general conclusion which may be drawn from a comparison of Eqs. (3.46) and 
(3.47) is that 

p =-ifiA + f(x), (3.48) 
dx 

since (as is easily demonstrated) a general function f (x) of the position operator 
automatically commutes with x. 

We have chosen to normalize the eigenkets and eigenbras of the position oper- 
ator so that they satisfy the normalization condition (3.26). However, this choice 
of normalization does not uniquely determine the eigenkets and eigenbras. Sup- 
pose that we transform to a new set of eigenbras which are related to the old set 
via 

(x / |new = e iY '(x / | old , (3.49) 

where y' = y(x') is a real function of x'. This transformation amounts to a 
rearrangement of the relative phases of the eigenbras. The new normalization 
condition is 

(xWw = (x / |e i ^e- i ^"|x // ) old = e i ^'^" ) (x / |x // )o ld 

= e i(y '- y " ) 6(x / -x // ) =6(x / -x"). (3.50) 
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Thus, the new eigenbras satisfy the same normalization condition as the old 
eigenbras. 

By definition, the standard ket } satisfies (x'|) = 1. It follows from Eq. (3.49) 
that the new standard ket is related to the old standard ket via 



new 



e iY )oid, (3.51) 



where y = y(x) is a real function of the position operator x. The dual of the 
above equation yields the transformation rule for the standard bra, 



new - 



oi d e iT . (3.52) 



The transformation rule for a general operator A follows from Eqs. (3.51) and 
(3.52), plus the requirement that the triple product (A) remain invariant (this 
must be the case, otherwise the probability of a measurement yielding a certain 
result would depend on the choice of eigenbras) . Thus, 

Anew = e- iY A old e^. (3.53) 

Of course, if A commutes with x then A is invariant under the transformation. In 
fact, d/dx is the only operator (we know of) which does not commute with x, so 
Eq. (3.53) yields 

(A.) =e -^d e , T d +i dy 

Vdx/new dx dx dx 

where the subscript "old" is taken as read. It follows, from Eq. (3.48), that the 
momentum operator p can be written 

p = -ih(-^) -Ti^ + f(x). (3.55) 

\ (XX / new Q-X 



Thus, the special choice 



yields 



Fiy(x) 



f (x) dx (3.56) 



. (3.57) 



\dxj 



new 
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Equation (3.56) fixes y to within an arbitrary additive constant: i.e., the special 
eigenkets and eigenbras for which Eq. (3.57) is true are determined to within an 
arbitrary common phase-factor. 

In conclusion, it is possible to find a set of basis eigenkets and eigenbras of 
the position operator x which satisfy the normalization condition (3.26), and for 
which the momentum conjugate to x can be represented as the operator 



A general state ket is written ip(x)), where the standard ket ) satisfies (x'|) = 
1, and where ip(x') = (x'\^\)[x)) is the wave-function. This scheme of things is 
known as Schrddinger's representation, and is the basis of wave mechanics. 

3.5 Schrodinger's representation - II 

In the preceding sections, we have developed Schrodinger's representation for 
the case of a single operator x corresponding to a classical Cartesian coordinate. 
However, this scheme can easily be extended. Consider a system with N general- 
ized coordinates, qi • • • q^, which can all be simultaneously measured. These are 
represented as N commuting operators, qi • • • cJnj each with a continuous range 
of eigenvalues, q\ ■ ■ ■ q^. Ket space is conveniently spanned by the simultaneous 
eigenkets of qi • • • q^, which are denoted \q[ ■ ■ ■ q^). These eigenkets must form 
a complete set, otherwise the qi • ■ ■ would not be simultaneously observable. 

The orthogonality condition for the eigenkets [i.e., the generalization of Eq. (3.26)] 



P 




(3.58) 



rs 




(3.59) 



dqj •••dq;jq 1 / ---q; j }(q 1 '---q; j 



1. 



(3.60) 



oo 



oo 



The standard ket ) is defined such that 



(3.61) 
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The standard bra ( is the dual of the standard ket. A general state ket is written 

iMqi---q N )). (3.62) 

The associated wave-function is 

iMqi-'-q^Hqi-'-q^}. (3-63) 
Likewise, a general state bra is written 

(cKqT--q N ), (3-64) 

where 

*(qi---qN) = <*lqi---qN>- (3-65) 
The probability of an observation of the system finding the first coordinate in the 
range qj to q{ + dqj, the second coordinate in the range q' 2 to q' 2 + dq^, etc., is 

P(q{ • • • q^; dqj • • • dq(J = N>(q{ • • • q^| 2 dqj • • • dq^. (3.66) 

Finally, the normalization condition for a physical wave-function is 



f +0O 



-oo 



iKqi---q^| 2 dqi---dq; = l. (3.67) 



-oo 



The N linear operators 3/3qi (where i runs from 1 to N) are defined 

%> = ^>. (3.68) 



3qi 3qi' 

These linear operators can also act on bras (provided the associated wave-functions 
are square integrable) in accordance with [see Eq. (3.42)] 

= -<|t (3.69) 
3qi 3qi 

Corresponding to Eq. (3.46), we can derive the commutation relations 

^ 3 

qi-q^ = 6ii. (3.70) 



3qr J ^3qi 
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It is also clear that 



3 9 v 9 2 ib v 3 3 v 



showing that 



a a a a 



dqidq, 3qj 3qt* 



(3.72) 



It can be seen, by comparison with Eqs. (3.23)-(3.25), that the linear oper- 
ators — i hd/dqi satisfy the same commutation relations with the q's and with 
each other that the p's do. The most general conclusion we can draw from this 
coincidence of commutation relations is (see Dirac) 

Pt = -ih— + . (3.73) 

However, the function F can be transformed away via a suitable readjustment of 
the phases of the basis eigenkets (see Sect. 3.4, and Dirac). Thus, we can always 
construct a set of simultaneous eigenkets of qi ■ • ■ qN for which 

Pi = -ih— . (3.74) 

3qi 

This is the generalized Schrodinger representation. 
It follows from Eqs. (3.61), (3.68), and (3.74) that 

Pi) = 0. (3.75) 

Thus, the standard ket in Schrodinger's representation is a simultaneous eigenket 
of all the momentum operators belonging to the eigenvalue zero. Note that 

(Pi ■ ■ • = «,'- = 3 "" q ' a ;- q " = {~M\ • • • (3.76) 

3qi 3qi 3qj 3q( 



Hence, 



so that 



<qi---qNl^ = ^(qi---qNl> (3.77) 
3qi 3qi 

g 

qi , ---qNlPi = -i^g q 7<qi / ---qNl- (3.78) 



u 
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The dual of the above equation gives 



Pt|q{---qN>=iTi— 7 |q;---q;>. (3.79) 



3.6 The momentum representation 

Consider a system with one degree of freedom, describable in terms of a coordi- 
nate x and its conjugate momentum p, both of which have a continuous range 
of eigenvalues. We have seen that it is possible to represent the system in terms 
of the eigenkets of x. This is termed Schrodinger's representation. However, it is 
also possible to represent the system in terms of the eigenkets of p. 

Consider the eigenkets of p which belong to the eigenvalues p\ These are 
denoted |p'). The orthogonality relation for the momentum eigenkets is 

<pV>=S(p'-p"), (3-80) 
and the corresponding completeness relation is 

dpV><p'l = 1. (3.81) 
A general state ket can be written 

*(p)> (3.82) 

where the standard ket } satisfies 

(p'l> = 1. (3.83) 

Note that the standard ket in this representation is quite different to that in 
Schrodinger's representation. The momentum space wave-function (Mp') sat- 
isfies 

cMp') = (pi*). (3.84) 

The probability that a measurement of the momentum yields a result lying in the 
range p ' to p ' + dp ' is given by 

P(p',dp') = |*(p')l 2 dp'. (3.85) 



— oo 
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Finally, the normalization condition for a physical momentum space wave-function 
is 



-oo 

J\\2 a„i 



4>(p')|dp' = 1. (3.86) 

The fundamental commutation relations (3.23)-(3.25) exhibit a particular 
symmetry between coordinates and their conjugate momenta. If all the coor- 
dinates are transformed into their conjugate momenta, and vice versa, and i is 
then replaced by — i, the commutation relations are unchanged. It follows from 
this symmetry that we can always choose the eigenkets of p in such a manner 
that the coordinate x can be represented as (see Sect. 3.4) 

x = ih,-^. (3.87) 
dp 

This is termed the momentum representation. 

The above result is easily generalized to a system with more than one degree 
of freedom. Suppose the system is specified by N coordinates, pi • • • cjnj an d 
N conjugate momenta, pi • • ■ Pn- Then, in the momentum representation, the 
coordinates can be written as 

g 

qi = ih—. (3.88) 

We also have 

= 0, (3.89) 



and 

g 

(p 1 , ---p^|q i = iru— (v\---Vu\- (3-90) 

The momentum representation is less useful than Schrodinger's representa- 
tion for a very simple reason. The energy operator (i.e., the Hamiltonian) of 
most simple systems takes the form of a sum of quadratic terms in the momenta 
(i.e., the kinetic energy) plus a complicated function of the coordinates (i.e., the 
potential energy). In Schrodinger's representation, the eigenvalue problem for 
the energy translates into a second-order differential equation in the coordinates, 
with a complicated potential function. In the momentum representation, the 



47 



3.7 The uncertainty relation 



3 POSITION AND MOMENTUM 



problem transforms into a high-order differential equation in the momenta, with 
a quadratic potential. With the mathematical tools at our disposal, we are far bet- 
ter able to solve the former type of problem than the latter. Hence, Schrodinger's 
representation is generally more useful than the momentum representation. 



3.7 The uncertainty relation 



How is a momentum space wave-function related to the corresponding coordi- 
nate space wave-function? To answer this question, let us consider the represen- 
tative (x'lp') of the momentum eigenkets |p') in Schrodinger's representation for 
a system with a single degree of freedom. This representative satisfies 



p'(xV) 



ih^(xV), 



(3.91) 



where use has been made of Eq. (3.78) (for the case of a system with one degree 
of freedom) . The solution of the above differential equation is 



(x'lp'} = c'exp(ip V/Ti), 
where c' = c'(p'). It is easily demonstrated that 



<p'ip" 



poo 



(p'|x') dx'(x'|p" 



c'V 



The well-known mathematical result 

^+00 



exp(i ax) dx = 27t6(a), 

J — oo 

yields 

<p'|p") = |c'| 2 h6(p'-p"). 
This is consistent with Eq. (3.80), provided that c' = h 1/7 . Thus, 

(x'lp') = hr 1/2 exp(ip'x'/ri). 



(3.92) 



exp[-i (p' - p") x'/h] dx'. (3.93) 



(3.94) 



(3.95) 



(3.96) 
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Consider a general state ket |A) whose coordinate wave-function is "4>(x') 3 and 
whose momentum wave-function is W(p'). In other words, 



x'|A), 
(p'lA). 



(3.97) 
(3.98) 



It is easily demonstrated that 



HV2 



dpWXp'IA) 

¥(p')exp(ipV/ri)dp' 



-| r+oo 



(3.99) 



and 



dx'<pV)(*'IA) 



HV2 



iM*') exp(— ipV/Pi) dx', 



(3.100) 



where use has been made of Eqs. (3.27), (3.81), (3.94), and (3.96). Clearly, the 
momentum space wave-function is the Fourier transform of the coordinate space 
wave-function. 

Consider a state whose coordinate space wave-function is a wave-packet. In 
other words, the wave-function only has non-negligible amplitude in some spa- 
tially localized region of extent Ax. As is well-know, the Fourier transform of a 
wave-packet fills up a wave-number band of approximate extent 6k ~ 1 /Ax. Note 
that in Eq. (3.99) the role of the wave-number k is played by the quantity p '/h. It 
follows that the momentum space wave-function corresponding to a wave-packet 
in coordinate space extends over a range of momenta Ap ~ Pi/Ax. Clearly, a mea- 
surement of x is almost certain to give a result lying in a range of width Ax. 
Likewise, measurement of p is almost certain to yield a result lying in a range of 
width Ap. The product of these two uncertainties is 



Ax Ap ~ Pi. 



(3.101) 



This result is called Heisenberg's uncertainty principle. 
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Actually, it is possible to write Heisenberg's uncertainty principle more exactly 
by making use of Eq. (2.83) and the commutation relation (3.47). We obtain 

<(Ax) 2 )<(Ap) 2 } > ~ (3.102) 

for any general state. It is easily demonstrated that the minimum uncertainty 
states, for which the equality sign holds in the above relation, correspond to 
Gaussian wave-packets in both coordinate and momentum space. 



3.8 Displacement operators 

Consider a system with one degree of freedom corresponding to the Cartesian 
coordinate x. Suppose that we displace this system some distance along the x- 
axis. We could imagine that the system is on wheels, and we just give it a little 
push. The final state of the system is completely determined by its initial state, 
together with the direction and magnitude of the displacement. Note that the 
type of displacement we are considering is one in which everything to do with the 
system is displaced. So, if the system is subject to an external potential, then the 
potential must be displaced. 

The situation is not so clear with state kets. The final state of the system 
only determines the direction of the displaced state ket. Even if we adopt the 
convention that all state kets have unit norms, the final ket is still not completely 
determined, since it can be multiplied by a constant phase-factor. However, we 
know that the superposition relations between states remain invariant under the 
displacement. This follows because the superposition relations have a physical 
significance which is unaffected by a displacement of the system. Thus, if 

|R) = |A> + |B) (3.103) 

in the undisplaced system, and the displacement causes ket |R) to transform to 
ket |Rd), etc., then in the displaced system we have 

|Rd) = |Ad) + |Bd). (3.104) 
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Incidentally, this determines the displaced kets to within a single arbitrary phase- 
factor to be multiplied into all of them. The displaced kets cannot be multiplied 
by individual phase-factors, because this would wreck the superposition relations. 

Since Eq. (3.104) holds in the displaced system whenever Eq. (3.103) holds in 
the undisplaced system, it follows that the displaced ket |Rd) must be the result 
of some linear operator acting on the undisplaced ket |R). In other words, 

|Rd) =D|R), (3.105) 

where D an operator which depends only on the nature of the displacement. The 
arbitrary phase-factor by which all displaced kets may be multiplied results in D 
being undetermined to an arbitrary multiplicative constant of modulus unity. 

We now adopt the ansatz that any combination of bras, kets, and dynamical 
variables which possesses a physical significance is invariant under a displace- 
ment of the system. The normalization condition 

(A|A) = 1 (3.106) 

for a state ket |A) certainly has a physical significance. Thus, we must have 

(Ad|Ad) = 1. (3.107) 

Now, |Ad) = D|A) and (Ad| = (Apt, so 

(A|D f D|A) = 1. (3.108) 

Since this must hold for any state ket |A), it follows that 

D f D = l. (3.109) 

Hence, the displacement operator is unitary. Note that the above relation implies 
that 

|A) =D f |Ad). (3.110) 

The equation 

v|A) = |B), (3.111) 
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where the operator v represents a dynamical variable, has some physical signifi- 
cance. Thus, we require that 

v d |Ad) = |Bd>, (3.112) 

where v d is the displaced operator. It follows that 

v d |Ad) =D|B) =Dv|A) = DvD t |Ad). (3.113) 

Since this is true for any ket |Ad), we have 

v d = DvD t . (3.114) 

Note that the arbitrary numerical factor in D does not affect either of the results 
(3.109) and (3.114). 

Suppose, now, that the system is displaced an infinitesimal distance 6x along 
the x-axis. We expect that the displaced ket | Ad) should approach the undisplaced 
ket | A) in the limit as 6x — > 0. Thus, we expect the limit 

lim |Ad) ~ |A) = lim 5_I|A> (3.115) 

to exist. Let 

d*=lim5fJ-, (3.116) 

5x^0 OX 

where d x is denoted the displacement operator along the x-axis. The fact that D 
can be replaced by D exp(iy), where y is a real phase-angle, implies that d x can 
be replaced by 

Dexp(iy)-1 D-l+iy ronm 

lim - = hm — = d x + i a x , (3.117) 

6x^0 OX 6x^0 OX 

where a x is the limit of y/6x. We have assumed, as seems reasonable, thaty tends 
to zero as 6x — > 0. It is clear that the displacement operator is undetermined to 
an arbitrary imaginary additive constant. 

For small 6x, we have 

D = l+6xd x . (3.118) 
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It follows from Eq. (3.109) that 

(1 + 6xd x t )(1 + 6xd x ) = 1. (3.119) 
Neglecting order (6x) 2 , we obtain 

dj + d x = 0. (3.120) 

Thus, the displacement operator is anti-Hermitian. Substituting into Eq. (3.114), 
and again neglecting order (6x) 2 , we find that 

v d = (1 + 6x d x ) v (1 — 6x d x ) = v + 6x (d x v — v d x ), (3.121) 

which implies 

lim = d x v-vd x . (3.122) 

Let us consider a specific example. Suppose that a state has a wave-function 
iMx'). If the system is displaced a distance 6x along the x-axis then the new 
wave-function is ij^x' — 5x) (i.e., the same shape shifted in the x-direction by a 
distance 6x) . Actually, the new wave-function can be multiplied by an arbitrary 
number of modulus unity. It can be seen that the new wave-function is obtained 
from the old wave-function according to the prescription x' — > x' — 6x. Thus, 

Xd = x — 6x. (3.123) 

A comparison with Eq. (3.122), using x = v, yields 

d x x-xd x = -l. (3.124) 

It follows that iTid x obeys the same commutation relation with x that p x , the 
momentum conjugate to x, does [see Eq. (3.25)]. The most general conclusion 
we can draw from this observation is that 

p x = ihd x + f(x), (3.125) 

where f is Hermitian (since p x is Hermitian). However, the fact that d x is unde- 
termined to an arbitrary additive imaginary constant (which could be a function 
of x) enables us to transform the function f out of the above equation, leaving 

p x = ihd x . (3.126) 
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Thus, the displacement operator in the x-direction is proportional to the momen- 
tum conjugate to x. We say that p x is the generator of translations along the 
x-axis. 

A finite translation along the x-axis can be constructed from a series of very 
many infinitesimal translations. Thus, the operator D(Ax) which translates the 
system a distance Ax along the x-axis is written 



The unitary nature of the operator is now clearly apparent. 

We can also construct displacement operators which translate the system along 
the y- and z-axes. Note that a displacement a distance Ax along the x-axis com- 
mutes with a displacement a distance Ay along the u-axis. In other words, if the 
system is moved Ax along the x-axis, and then Ay along the u-axis then it ends 
up in the same state as if it were moved Ay along the u-axis, and then Ax along 
the x-axis. The fact that translations in independent directions commute is clearly 
associated with the fact that the conjugate momentum operators associated with 
these directions also commute [see Eqs. (3.24) and (3.128)]. 




(3.127) 



where use has been made of Eqs. (3.118) and (3.126). It follows that 



D(Ax) = exp (— ip x Ax/Ti) . 



(3.128) 
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4 Quantum dynamics 

4. 1 Schrodinger's equations of motion 

Up to now, we have only considered systems at one particular instant of time. Let 
us now investigate how quantum mechanical systems evolve with time. 

Consider a system in a state A which evolves in time. At time t the state of the 
system is represented by the ket |At). The label A is needed to distinguish the 
ket from any other ket (|Bt), say) which is evolving in time. The label t is needed 
to distinguish the different states of the system at different times. 

The final state of the system at time t is completely determined by its initial 
state at time to plus the time interval t — to (assuming that the system is left 
undisturbed during this time interval) . However, the final state only determines 
the direction of the final state ket. Even if we adopt the convention that all state 
kets have unit norms, the final ket is still not completely determined, since it 
can be multiplied by an arbitrary phase-factor. However, we expect that if a 
superposition relation holds for certain states at time to then the same relation 
should hold between the corresponding time-evolved states at time t, assuming 
that the system is left undisturbed between times to and t. In other words, if 

|Rto> = |At ) + |Bto> (4.1) 
for any three kets, then we should have 

|Rt) = |At) + |Bt). (4.2) 

This rule determines the time-evolved kets to within a single arbitrary phase- 
factor to be multiplied into all of them. The evolved kets cannot be multiplied by 
individual phase-factors, since this would invalidate the superposition relation at 
later times. 

According to Eqs. (4.1) and (4.2), the final ket |Rt) depends linearly on the 
initial ket |Rto). Thus, the final ket can be regarded as the result of some linear 
operator acting on the initial ket: , i.e., 

|Rt) =T|Rto>, (4.3) 
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where T is a linear operator which depends only on the times t and to. The 
arbitrary phase-factor by which all time evolved kets may be multiplied results 
in T(t, to) being undetermined to an arbitrary multiplicative constant of modulus 
unity. 

Since we have adopted a convention in which the norm of any state ket is 
unity it make sense to define the time evolution operator T in such a manner 
that it preserves the length of any ket upon which it acts (i.e., if a ket is prop- 
erly normalized at time t then it will remain normalized at all subsequent times 
t > to). This is always possible, since the length of a ket possesses no physical 
significance. Thus, we require that 



Hence, the time evolution operator T is a unitary operator. 

Up to now, the time evolution operator T looks very much like the spatial 
displacement operator D introduced in the previous section. However, there are 
some important differences between time evolution and spatial displacement. 
In general, we do expect the expectation value of some observable £, to evolve 
with time, even if the system is left in a state of undisturbed motion (after all, 
time evolution has no meaning unless something observable changes with time) . 
The triple product (A|£JA) can evolve either because the ket |A) evolves and the 
operator £, stays constant, the ket |A) stays constant and the operator £, evolves, 
or both the ket |A) and the operator £, evolve. Since we are already committed to 
evolving state kets, according to Eq. (4.3), let us assume that the time evolution 
operator T can be chosen in such a manner that the operators representing the 
dynamical variables of the system do not evolve in time (unless they contain some 
specific time dependence). 

We expect, from physical continuity, that as t — > to then |At) — > |Ato) for any 
ket A. Thus, the limit 



(AtolAto) 

for any ket A, which immediately yields 

TtT 



(At|At) 



1. 



(4.4) 



(4.5) 



lim 

t-rto 



At) - |At ) 
t-t 



lim 

t-»to t - t 



At ) 



(4.6) 
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should exist. Note that this limit is simply the derivative of |Ato) with respect to 
to- Let 

T(t,t )-1 

T(to) = lun — - — . (4.7) 

t— >t t — to 

It is easily demonstrated from Eq. (4.5) that t is anti-Hermitian: i.e., 

T f + t = 0. (4.8) 

The fact that T can be replaced by Texp(iy) (where y is real) implies that t is 
undetermined to an arbitrary imaginary additive constant (see previous section). 
Let us define the Hermitian operator H(to) = i fir. This operator is undetermined 
to an arbitrary real additive constant. It follows from Eqs. (4.6) and (4.7) that 

. h d\A±ol = [h]im I At) ~ < At o) = i rtT(to)IAto) = H(to)IAto). (4.9) 
ato t-^to t — to 

When written for general t this equation becomes 

iFL^!M = H(t)|At>. (4.10) 
dt 

Equation (4.10) gives the general law for the time evolution of a state ket in 
a scheme in which the operators representing the dynamical variables remain 
fixed. This equation is denoted Schrddinger's equation of motion. It involves a 
Hermitian operator H(t) which is, presumably, a characteristic of the dynamical 
system under investigation. 

We saw, in the previous section, that if the operator D(x,xo) displaces the 
system along the x-axis from xo to x then 

p^iKlim 13 ^- 1 , (4.11) 

x^x x — Xo 

where p x is the operator representing the momentum conjugate to x. We now 
have that if the operator T(t, to) evolves the system in time from to to t then 

H(to)=iTilim T(t ; to) ~ 1 . (4.12) 

t-»t t — to 
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Thus, the dynamical variable corresponding to the operator H stands to time t as 
the momentum p x stands to the coordinate x. By analogy with classical physics, 
this suggests that H(t) is the operator representing the total energy of the system. 
(Recall that, in classical physics, if the equations of motion of a system are in- 
variant under an x-displacement of the system then this implies that the system 
conserves momentum in the x-direction. Likewise, if the equations of motion are 
invariant under a temporal displacement then this implies that the system con- 
serves energy.) The operator H(t) is usually called the Hamiltonian of the system. 
The fact that the Hamiltonian is undetermined to an arbitrary real additive con- 
stant is related to the well-known phenomenon that energy is undetermined to 
an arbitrary additive constant in physics (i.e., the zero of potential energy is not 
well-defined). 

Substituting |At) = T|At ) into Eq. (4.10) yields 

ih^lAto) =H(t)T|At >. (4.13) 
Since this must hold for any initial state |Ato) we conclude that 

ih^ = H(t)T. (4.14) 
at 

This equation can be integrated to give 

T(t,t ) = exp -i 



H(t') dt'/Fi) , (4.15) 

to 



where use has been made of Eqs. (4.5) and (4.6). (Here, we assume that Hamil- 
tonian operators evaluated at different times commute with one another). It is 
now clear how the fact that H is undetermined to an arbitrary real additive con- 
stant leaves T undetermined to a phase-factor. Note that, in the above analysis, 
time is not an operator (we cannot observe time, as such), it is just a parameter 
(or, more accurately, a continuous label). Since we are only dealing with non- 
relativistic quantum mechanics, the fact that position is an operator, but time is 
only a label, need not worry us unduly. In relativistic quantum mechanics, time 
and space coordinates are treated on the same footing by relegating position from 
being an operator to being just a label. 
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4.2 Heisenberg's equations of motion 

We have seen that in Schrodinger's scheme the dynamical variables of the system 
remain fixed during a period of undisturbed motion, whereas the state kets evolve 
according to Eq. (4.10). However, this is not the only way in which to represent 
the time evolution of the system. 

Suppose that a general state ket A is subject to the transformation 

|At>=T+(t,to)|A). (4.16) 

This is a time-dependent transformation, since the operator T(t,to) obviously 
depends on time. The subscript t is used to remind us that the transformation is 
time-dependent. The time evolution of the transformed state ket is given by 

|A t t) = V{t t tQ)\At) =T t (t,to)T(t,to)|At ) = |A t t ), (4.17) 

where use has been made of Eqs. (4.3), (4.5), and the fact that T(to,to) = 1. 
Clearly, the transformed state ket does not evolve in time. Thus, the transforma- 
tion (4.16) has the effect of bringing all kets representing states of undisturbed 
motion of the system to rest. 

The transformation must also be applied to bras. The dual of Eq. (4.16) yields 

(A t | = (A|T. (4.18) 

The transformation rule for a general observable v is obtained from the require- 
ment that the expectation value (A|v|A) should remain invariant. It is easily seen 
that 

v t = T t vT. (4.19) 

Thus, a dynamical variable, which corresponds to a fixed linear operator in Schro- 
dinger's scheme, corresponds to a moving linear operator in this new scheme. It 
is clear that the transformation (4.16) leads us to a scenario in which the state 
of the system is represented by a fixed vector, and the dynamical variables are 
represented by moving linear operators. This is termed the Heisenberg picture, as 
opposed to the Schrddinger picture, which is outlined in Sect. 4.1. 
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Consider a dynamical variable v corresponding to a fixed linear operator in the 
Schrodinger picture. According to Eq. (4.19), we can write 

Tv t = vT. (4.20) 

Differentiation with respect to time yields 

dT dv t dT , ~ 

dT Vt + T dF = v dT (4 - 21) 

With the help of Eq. (4.14), this reduces to 

HTv t + iPiT— =vHT, (4.22) 
dt 

or 

iFi^ = T t vHT-TtHTv t = v t H t -H t v t , (4.23) 
dt 

where 

H t = T t HT (4.24) 

Equation (4.23) can be written 

ih ^T = [ v t> H t]- (4-25) 
dt 

Equation (4.25) shows how the dynamical variables of the system evolve in 
the Heisenberg picture. It is denoted Heisenberg's equation of motion. Note that 
the time-varying dynamical variables in the Heisenberg picture are usually called 
Heisenberg dynamical variables to distinguish them from Schrodinger dynamical 
variables (i.e., the corresponding variables in the Schrodinger picture), which do 
not evolve in time. 

According to Eq. (3.22), the Heisenberg equation of motion can be written 

dv t 

-TjT = [v t ,H t J quantum, (4.26) 

where [■ • ■] quantum denotes the quantum Poisson bracket. Let us compare this equa- 
tion with the classical time evolution equation for a general dynamical variable 
v, which can be written in the form [see Eq. (3.7)] 

dv 

-T7 = [v, H] classical . (4.27) 
dt 
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Here, [• • ■] classical is the classical Poisson bracket, and H denotes the classical 
Hamiltonian. The strong resemblance between Eqs. (4.26) and (4.27) provides 
us with further justification for our identification of the linear operator H with 
the energy of the system in quantum mechanics. 

Note that if the Hamiltonian does not explicitly depend on time (i.e., the sys- 
tem is not subject to some time-dependent external force) then Eq. (4.15) yields 

T(t, t ) = exp [-i H (t - 1 )/ h] . (4.28) 

This operator manifestly commutes with H, so 

H t = T t HT = H. (4.29) 

Furthermore, Eq. (4.25) gives 

ih 6 ^ = [H,H] = 0. (4.30) 
dt 

Thus, if the energy of the system has no explicit time-dependence then it is rep- 
resented by the same non-time-varying operator H in both the Schrodinger and 
Heisenberg pictures. 

Suppose that v is an observable which commutes with the Hamiltonian (and, 
hence, with the time evolution operator T). It follows from Eq. (4.19) that v t = v. 
Heisenberg's equation of motion yields 

ih^ = [v,H]=0. (4.31) 
at 

Thus, any observable which commutes with the Hamiltonian is a constant of the mo- 
tion (hence, it is represented by the same fixed operator in both the Schrodinger 
and Heisenberg pictures). Only those observables which do not commute with 
the Hamiltonian evolve in time in the Heisenberg picture. 



4.3 Ehrenfest's theorem 

We have now derived all of the basic elements of quantum mechanics. The only 
thing which is lacking is some rule to determine the form of the quantum mechan- 
ical Hamiltonian. For a physical system which possess a classical analogue, we 



61 



4.3 Ehrenfest's theorem 



4 QUANTUM DYNAMICS 



generally assume that the Hamiltonian has the same form as in classical physics 
(i.e., we replace the classical coordinates and conjugate momenta by the corre- 
sponding quantum mechanical operators) . This scheme guarantees that quantum 
mechanics yields the correct classical equations of motion in the classical limit. 
Whenever an ambiguity arises because of non-commuting observables, this can 
usually be resolved by requiring the Hamiltonian H to be an Hermitian oper- 
ator. For instance, we would write the quantum mechanical analogue of the 
classical product xp, appearing in the Hamiltonian, as the Hermitian product 
(1 /2) (xp + px). When the system in question has no classical analogue then we 
are reduced to guessing a form for H which reproduces the observed behaviour 
of the system. 

Consider a three-dimensional system characterized by three independent Carte- 
sian position coordinates Xi (where i runs from 1 to 3), with three corresponding 
conjugate momenta pi. These are represented by three commuting position op- 
erators Xi, and three commuting momentum operators pi, respectively. The com- 
mutation relations satisfied by the position and momentum operators are [see 
Eq. (3.25)] 

[xi.pj] =irt6 tj . (4.32) 

It is helpful to denote (xi,X2,X3) as x and (pi,P2,P3) as p. The following useful 
formulae, 

[xi,F(p)] = ih— , (4.33) 
3pi 

[pi,G(x)] = -ih— , (4.34) 

OXi 

where F and G are functions which can be expanded as power series, are easily 
proved using the fundamental commutation relations Eq. (4.32). 

Let us now consider the three-dimensional motion of a free particle of mass m 
in the Heisenberg picture. The Hamiltonian is assumed to have the same form as 
in classical physics: 

H = f- = — ^ Pi 2 . (4.35) 
2 m 2m 

X—\ 
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In the following, all dynamical variables are assumed to be Heisenberg dynamical 
variables, although we will omit the subscript t for the sake of clarity. The time 
evolution of the momentum operator follows from Heisenberg's equation of 
motion (4.25). We find that 



dpi 



1 



dt i ft 



[pt,H] =0, 



(4.36) 



since pi automatically commutes with any function of the momentum operators. 
Thus, for a free particle the momentum operators are constants of the motion, 
which means that Pi(t) = Pi(0) at all times t (for i is 1 to 3). The time evolution 
of the position operator X{ is given by 



dxt i i i a 

dt iTi irizm opt 



5> 



Pi 
m 



Pi(0) 



m 



(4.37) 



where use has been made of Eq. (4.33). It follows that 

Pi(0)' 



xt(t) = xt(O) + 



m 



t, 



(4.38) 



which is analogous to the equation of motion of a classical free particle. Note 
that even though 

[x t (0),x J (0)] = 0, (4.39) 

where the position operators are evaluated at equal times, the Xt do not commute 
when evaluated at different times. For instance, 



[xi(t),Xi(0)] 



pt(0)t 



m 



MO) 



-iht 



m 



(4.40) 



Combining the above commutation relation with the uncertainty relation (2.83) 
yields 

((Ax i ) 2 ) t ((Ax i ) 2 ) t=0 > ( 4 - 41 ) 

This result implies that even if a particle is well-localized at t = 0, its position 
becomes progressively more uncertain with time. This conclusion can also be 
obtained by studying the propagation of wave-packets in wave mechanics. 
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Let us now add a potential V(x) to our free particle Hamiltonian: 



H 



2m 



+ V(x). 



(4.42) 



Here, V is some function of the Xi operators. Heisenberg's equation of motion 
gives 

dpi 1 r w , 3V(x) 



[pt,V(x)] 



dt i Pi 3xt 
where use has been made of Eq. (4.34). On the other hand, the result 

dx t _ p t 
dt m 



(4.43) 



(4.44) 



still holds, because the Xi all commute with the new term V(x) in the Hamilto- 
nian. We can use the Heisenberg equation of motion a second time to deduce 
that 



d z xt 



1 

Th 



dxt 
dt 



H 



1 

ih 



Pi 
m' 



H 



In vectorial form, this equation becomes 

d 2 x _ dp 

dt 1 = dt 



m 



^dpi 
m dt 



-VV(x). 



i av(x) 

m 3xi 



(4.45) 



(4.46) 



This is the quantum mechanical equivalent of Newton's second law of motion. 
Taking the expectation values of both sides with respect to a Heisenberg state ket 
that does not move with time, we obtain 



m 



d^x) 
dt 2 



dt 



-<VV(x)>. 



(4.47) 



This is known as Ehrenfest's theorem. When written in terms of expectation 
values, this result is independent of whether we are using the Heisenberg or 
Schrodinger picture. In contrast, the operator equation (4.46) only holds if x and 
p are understood to be Heisenberg dynamical variables. Note that Eq. (4.47) has 
no dependence on Pi. In fact, it guarantees to us that the centre of a wave-packet 
always moves like a classical particle. 
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4.4 Schrodinger's wave-equation 



Let us now consider the motion of a particle in three dimensions in the Schrodinger 
picture. The fixed dynamical variables of the system are the position operators 
x = (xi,X2,X3), and the momentum operators p = (pi,P2,P3). The state of the 
system is represented as some time evolving ket |At). 

Let |x') represent a simultaneous eigenket of the position operators belonging 
to the eigenvalues x' = (x^x^x^). Note that, since the position operators are 
fixed in the Schrodinger picture, we do not expect the |x') to evolve in time. The 
wave-function of the system at time t is defined 

i|>(x',t) = (x'|At). (4.48) 

The Hamiltonian of the system is taken to be 

p 2 

H = f +Vx). (4.49) 
2m 

Schrodinger's equation of motion (4.10) yields 

. h 3(xVu) = (4>50) 
ot 

where use has been made of the time independence of the |x'). We adopt Schrod- 
inger's representation in which the momentum conjugate to the position operator 
Xi is written [see Eq. (3.74)] 

Pi = -ih— . (4.51) 
3xi 

Thus, 

h 2 



x' 



Pi 

2 m 



At >=-(f^l V'VlAt), (4.52) 



where use has been made of Eq. (3.78). Here, V' = (3/3x', d/dy' } d/dz') denotes 
the gradient operator written in terms of the position eigenvalues. We can also 
write 

(x'|V(x) = V(x')(x'|, (4.53) 
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where V(x') is a scalar function of the position eigenvalues. Combining Eqs. (4.49), 
(4.50), (4.52), and (4.53), we obtain 

[h d(x'\At) = _ ( M v'VlAt) + V(x')<x'|At), (4.54) 
3t \^ m J 

which can also be written 

m WxVt) = _ v a Mx , t) + v(x')il)(x',t). (4.55) 

This is Schrodinger's famous wave-equation, and is the basis of wave mechanics. 
Note, however, that the wave-equation is just one of many possible representa- 
tions of quantum mechanics. It just happens to give a type of equation which we 
know how to solve. In deriving the wave-equation, we have chosen to represent 
the system in terms of the eigenkets of the position operators, instead of those 
of the momentum operators. We have also fixed the relative phases of the |x') 
according to Schrodinger's representation, so that Eq. (4.51) is valid. Finally, we 
have chosen to work in the Schrodinger picture, in which state kets evolve and 
dynamical variables are fixed, instead of the Heisenberg picture, in which the 
opposite is true. 

Suppose that the ket |At) is an eigenket of the Hamiltonian belonging to the 
eigenvalue H': 

H| At) = H / |At). (4.56) 
Schrodinger's equation of motion (4.10) yields 

. h d|At) =H 

dt 1 

This can be integrated to give 

|At) = exp[-iH'(t-to)/"Fi]|Ato). (4.58) 

Note that |At) only differs from |Ato) by a phase-factor. The direction of the 
vector remains fixed in ket space. This suggests that if the system is initially in an 
eigenstate of the Hamiltonian then it remains in this state for ever, as long as the 
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system is undisturbed. Such a state is called a stationary state. The wave-function 
of a stationary state satisfies 

i|)(x',t) =iKx , ,to)exp[-iH , (t-t )/ri]. (4.59) 



Substituting the above relation into Schrodinger's wave equation (4.55), we 
obtain 

/ p. 2 \ 

, , V ,2 -iMx') + (V(x') - E) iMx') = 0, (4.60) 
2 my 

where iM x ') = ^(x'jto), and E = H' is the energy of the system. This is 
Schrodinger's time-independent wave-equation. A bound state solution of the 
above equation, in which the particle is confined within a finite region of space, 
satisfies the boundary condition 

i|>o(x') -> as |x'| -> oo. (4.61) 

Such a solution is only possible if 

E< lim V(x'). (4.62) 

|x'|— >oo 

Since it is conventional to set the potential at infinity equal to zero, the above 
relation implies that bound states are equivalent to negative energy states. The 
boundary condition (4.61) is sufficient to uniquely specify the solution of Eq. (4.60). 

The quantity p(x', t), defined by 

p(x , ,t) = |iHx , ,t)| 2 , (4.63) 

is termed the probability density. Recall, from Eq. (3.30), that the probability of 
observing the particle in some volume element d 3 x' around position x' is propor- 
tional to p(x', t) d 3 x'. The probability is equal to p(x', t) d 3 x' if the wave-function 
is properly normalized, so that 

r p(x',t)dV = 1. (4.64) 

Schrodinger's time-dependent wave-equation, (4.55), can easily be written in 
the form of a conservation equation for the probability density: 

^ + V'-j = 0. (4.65) 
ot 
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The probability current j takes the form 

fih 



j(x',t) 



,2m, 



[o|>* V'o|> - (V'f) = Im(\p*V'ip). (4.66) 



We can integrate Eq. (4.65) over all space, using the divergence theorem, and the 
boundary condition p — > as |x'| — > oo, to obtain 



3 

at 



p(x',t) dV = 0. 



(4.67) 



Thus, Schrodinger's wave-equation conserves probability. In particular, if the 
wave-function starts off properly normalized, according to Eq. (4.64), then it 
remains properly normalized at all subsequent times. It is easily demonstrated 
that 

|3„/ _ <P>t 



j(x',t) dV 



m 



(4.68) 



where (p) t denotes the expectation value of the momentum evaluated at time t. 
Clearly, the probability current is indirectly related to the particle momentum. 

In deriving Eq. (4.65) we have, naturally, assumed that the potential V(x') is 
real. Suppose, however, that the potential has an imaginary component. In this 
case, Eq. (4.65) generalizes to 



9p 

at 



+ v'-j 



2Im(V) 

^h 



P, 



giving 



a 

at 



p(x',t) dV 



Pi 



Im 



V(x')p(x',t) dV. 



(4.69) 



(4.70) 



Thus, if Im(V) < then the total probability of observing the particle anywhere 
in space decreases monotonically with time. Thus, an imaginary potential can 
be used to account for the disappearance of a particle. Such a potential is often 
employed to model nuclear reactions in which incident particles can be absorbed 
by nuclei. 



The wave-function can always be written in the form 

iS(x',t)" 
h 



iMx',t) = VP(x',t) exp 



(4.71) 
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where p and S are both real functions. The interpretation of p as a probability 
density has already been given. What is the interpretation of S? Note that 

iP*V't|> = y/pV'{y/p) + (^j pV'S. (4.72) 
It follows from Eq. (4.66) that 

j = (4.73) 
m 

Thus, the gradient of the phase of the wave-function determines the direction of 
the probability current. In particular, the probability current is locally normal to 
the contours of the phase-function S. 

Let us substitute Eq. (4.71) into Schrodinger's time-dependent wave-equation. 
We obtain 

1 r h 2 V' 2 y/p + 2i W( VP) • V'S - v/p |V'S| 2 + i Vp V /2 S] + y/p V 



2m 



3^ as 



(4.74) 



Let us treat Pi as a small quantity. To lowest order, Eq. (4.74) yields 
3S(x',t) 1 



9t 2 m 



V'S(x', t) I 2 + V(x', t) = H(x', V'S, t) , (4.75) 



where H(x, p, t) is the Hamiltonian operator. The above equation is known as the 
Hamilton-Jacobi equation, and is one of the many forms in which we can write 
the equations of classical mechanics. In classical mechanics, S is the action (i.e., 
the path-integral of the Lagrangian). Thus, in the limit Pi — > 0, wave mechanics 
reduces to classical mechanics. It is a good approximation to neglect the terms 
involving Pi in Eq. (4.74) provided that 

Pi|V /2 S| <C |V'S| 2 . (4.76) 

Note that, according to Eq. (4.71), 

A = Wsv (4 - 77) 
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where A is the de Broglie wave-length divided by 2n. The inequality (4.76) is 
equivalent to 

|V'A|<1. (4.78) 

In other words, quantum mechanics reduces to classical mechanics whenever 
the de Broglie wave-length is small compared to the characteristic distance over 
which things (other than the quantum phase) vary. This distance is usually set by 
the variation scale-length of the potential. 
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5 Angular momentum 

5.1 Orbital angular momentum 

Consider a particle described by the Cartesian coordinates (x,y,z) = r and their 
conjugate momenta (p x , p y , p z ) = p. The classical definition of the orbital angular 
momentum of such a particle about the origin is L = r x p, giving 

L x = up z -zpy, (5.1) 
L y = zp x -xp z , (5.2) 
U = KVy-VVx- (5.3) 

Let us assume that the operators (L x , L y , L z ) = L which represent the compo- 
nents of orbital angular momentum in quantum mechanics can be defined in an 
analogous manner to the corresponding components of classical angular momen- 
tum. In other words, we are going to assume that the above equations specify 
the angular momentum operators in terms of the position and linear momentum 
operators. Note that L x , L y , and L z are Hermitian, so they represent things which 
can, in principle, be measured. Note, also, that there is no ambiguity regard- 
ing the order in which operators appear in products on the right-hand sides of 
Eqs. (5.1)-(5.3), since all of the products consist of operators which commute. 

The fundamental commutation relations satisfied by the position and linear 
momentum operators are [see Eqs. (3.23)-(3.25)] 

[x^] = 0, (5.4) 
[pt,Pj] = 0, (5.5) 
[Xi,pj = ihSij, (5.6) 

where i and j stand for either x, y, or z. Consider the commutator of the operators 
L x and L z : 

[Lx, Ly] = [(uPz-zpy), (zp x -xp z )] =y [p z ,z]p x + xp y [z,p z ] 

= i h[-y p x + xp y ) = i hL z . (5.7) 
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The cyclic permutations of the above result yield the fundamental commutation 
relations satisfied by the components of an angular momentum: 

[L X ,LJ = i?iL z , (5.8) 
[Lu,U = iFiL x , (5.9) 
[L Z ,LJ = i?iL y . (5.10) 

These can be summed up more succinctly by writing 

L x L = i PiL. (5.11) 

The three commutation relations (5.8)-(5.10) are the foundation for the whole 
theory of angular momentum in quantum mechanics. Whenever we encounter 
three operators having these commutation relations, we know that the dynamical 
variables which they represent have identical properties to those of the compo- 
nents of an angular momentum (which we are about to derive) . In fact, we shall 
assume that any three operators which satisfy the commutation relations (5.8)- 
(5.10) represent the components of an angular momentum. 

Suppose that there are N particles in the system, with angular momentum 
vectors L t (where i runs from 1 to N). Each of these vectors satisfies Eq. (5.11), 
so that 

Li x Li = i FiLi. (5.12) 

However, we expect the angular momentum operators belonging to different par- 
ticles to commute, since they represent different degrees of freedom of the sys- 
tem. So, we can write 

Li x Lj + Lj x Li = 0, (5.13) 

for i 7^ j. Consider the total angular momentum of the system, L = Y-iL] ^i- It is 
clear from Eqs. (5.12) and (5.13) that 

N N N -j N 

L x L = y~ Lj x ^~ Lj = ^~ L t x Lj + - ^~ (Lj x Lj + Lj x L t ) 

i=1 j=1 i=1 x,j=1 

N 

= irt^Li = i?iL. (5.14) 
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Thus, the sum of two or more angular momentum vectors satisfies the same 
commutation relation as a primitive angular momentum vector. In particular, 
the total angular momentum of the system satisfies the commutation relation 
(5.11). 

The immediate conclusion which can be drawn from the commutation rela- 
tions (5.8)-(5.10) is that the three components of an angular momentum vector 
cannot be specified (or measured) simultaneously. In fact, once we have speci- 
fied one component, the values of other two components become uncertain. It is 
conventional to specify the z-component, L z . 

Consider the magnitude squared of the angular momentum vector, L 2 = L x 2 + 
L y 2 + L z 2 . The commutator of L 2 and L z is written 

[L 2 , LJ = [L x 2 , LJ + [L y 2 , L J + [L z 2 , LJ . (5.15) 

It is easily demonstrated that 

[L X 2 ,L Z ] = -i?i(L x L y + L y L x ), (5.16) 

[L y 2 ,L z ] = +iPi(L x L y + L y L x ), (5.17) 

[L Z 2 ,L Z ] = 0, (5.18) 

so 

[L 2 ,L z ]=0. (5.19) 

Since there is nothing special about the z-axis, we conclude that L 2 also commutes 
with L x and L y . It is clear from Eqs. (5.8)-(5.10) and (5.19) that the best we can 
do in quantum mechanics is to specify the magnitude of an angular momentum 
vector along with one of its components (by convention, the z-component) . 

It is convenient to define the shift operators L + and L : 

L + = L x + iLy, (5.20) 

L = L x -iLy. (5.21) 

Note that 

[L + ,L Z ] = -PiL + , (5.22) 
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[L " LJ = +PiL , (5.23) 
[L+L-] = 2PiL z . (5.24) 

Note, also, that both shift operators commute with L 2 . 



5.2 Eigenvalues of angular momentum 

Suppose that the simultaneous eigenkets of L 2 and L z are completely specified by 
two quantum numbers, I and m. These kets are denoted |l,m). The quantum 
number m is defined by 

L z |l, m) = m Ti|l, m). (5.25) 

Thus, m is the eigenvalue of L z divided by Pi. It is possible to write such an 
equation because Pi has the dimensions of angular momentum. Note that m is a 
real number, since L z is an Hermitian operator. 

We can write 

L 2 |l, m ) = f(l,m)Pi 2 |l,m), (5.26) 

without loss of generality, where f(l,m) is some real dimensionless function of I 
and m. Later on, we will show that f (I, m) = I (I + 1 ). Now, 

(I, m|L 2 - L z 2 |l, m) = (I, m|f (I, m) Pi 2 - m 2 fi 2 |l, m) = [f (I, m) - m 2 ] Pi 2 , (5.27) 

assuming that the |l, m) have unit norms. However, 

(l,m|L 2 -L z 2 |l,m> = (l,m|L x 2 + L y 2 |l,m) 

= (l,m|L x 2 |l,m) + (l,m|Ly 2 |l,m>. (5.28) 

It is easily demonstrated that 

(A|£, 2 |A) > 0, (5.29) 

where |A) is a general ket, and £, is an Hermitian operator. The proof follows 
from the observation that 

(A|£, 2 |A) = (A|£, f £JA) = <B|B), (5.30) 
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where |B) = £J A), plus the fact that (B|B) > for a general ket |B) [seeEq. (2.21)]. 
It follows from Eqs. (5.27)-(5.29) that 

m 2 <f(l,m). (5.31) 

Consider the effect of the shift operator L + on the eigenket |l,m). It is easily 
demonstrated that 

L 2 (L + |l,m)) = h 2 f(l,m) (L + |l,m», (5.32) 

where use has been made of Eq. (5.26), plus the fact that L 2 and L z commute. It 
follows that the ket L + |l, m) has the same eigenvalue of L 2 as the ket |l, m). Thus, 
the shift operator L + does not affect the magnitude of the angular momentum of 
any eigenket it acts upon. Note that 

L z L+|l,m) = (L + L z +[L z ,L+])|l,m) = (L+L 2 + TiL + )|l, m) 

= (m+1) FiL + |l,m), (5.33) 

where use has been made of Eq. (5.22). The above equation implies that L + |l, m) 
is proportional to |l, m + 1 }. We can write 

L+|l,m) = cj; m 7i|l,m+1), (5.34) 

where c^ m is a number. It is clear that when the operator L + acts on a simulta- 
neous eigenstate of L 2 and L z , the eigenvalue of L 2 remains unchanged, but the 
eigenvalue of L z is increased by Pi. For this reason, L + is called a raising operator. 

Using similar arguments to those given above, it is possible to demonstrate 
that 

L~|l,m) = cf m Pi|l, m— 1). (5.35) 
Hence, L is called a lowering operator. 

The shift operators step the value of m up and down by unity each time they 
operate on one of the simultaneous eigenkets of L 2 and L z . It would appear, at 
first sight, that any value of m can be obtained by applying the shift operators a 
sufficient number of times. However, according to Eq. (5.31), there is a definite 
upper bound to the values that m 2 can take. This bound is determined by the 
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eigenvalue of L 2 [see Eq. (5.26)]. It follows that there is a maximum and a 
minimum possible value which m can take. Suppose that we attempt to raise 
the value of m above its maximum value m max . Since there is no state with 
m > m max , we must have 

L + |l,m max ) = |0). (5.36) 

This implies that 

L-L + |l,m max ) = |0). (5.37) 

However, 

L L+ = L x 2 + L y 2 + i [U, LJ = L 2 - L z 2 - Pi L z , (5.38) 
so Eq. (5.37) yields 

(L 2 - L z 2 - PiLJ|l,m max ) = |0). (5.39) 
The above equation can be rearranged to give 

L 2 |l, m max ) = (L z 2 + PiLJ|l, m max ) = m max (m max + 1 ) h 2 |l, m max ). (5.40) 

Comparison of this equation with Eq. (5.26) yields the result 

f(l,m max ) = m max (m max + 1). (5.41) 

But, when L operates on |n, m max ) it generates |n, m max — 1 ), |n, m max — 2), etc. 
Since the lowering operator does not change the eigenvalue of L 2 , all of these 
states must correspond to the same value of f, namely m max (m max + 1 ). Thus, 

L 2 |l, m) = m max (m max + 1 ) h 2 |l, m) . (5.42) 

At this stage, we can give the unknown quantum number I the value m max , with- 
out loss of generality. We can also write the above equation in the form 

L 2 |l, m ) =1(1 + 1) Pi 2 |l,m). (5.43) 

It is easily seen that 

L _ L + |l,m) = (L 2 -L z 2 - FiL z )|l,m) = Fi 2 [l(l + 1 ) - m (m + 1 )]|l, m). (5.44) 
Thus, 

(I, m|L L + |l, m) = Fi 2 [l (I + 1 ) - m (m + 1 )]. (5.45) 
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However, we also know that 

(l,m|L _ L + |l,m) = (I, m|L TicJJl, m + 1) = Ti 2 cj m c^ m+1 , (5.46) 
where use has been made of Eqs. (5.34) and (5.35). It follows that 

c; m c u+1 = [l(H1)-m(m+1)]. (5.47) 

Consider the following: 

(I, m|L~|l, m + 1) = (I, m|L x |l, m + 1) — i (I, m|L y |l, m + 1) 

= (I, m + 1 |L x |l, m)* — i (I, m + 1 |L y |l, m)* 

= ((l,m+ 1|L x |l,m) + i (l,m+ l|L y |l,m))* 

= (l,m+1|L + |l,m)*, (5.48) 

where use has been made of the fact that L x and L y are Hermitian. The above 
equation reduces to 

<Wi = (c{J* (5.49) 
with the aid of Eqs. (5.34) and (5.35). 

Equations (5.47) and (5.49) can be combined to give 

|c+J 2 = [1(1+1) -m(m+1)]. (5.50) 

The solution of the above equation is 

c+ m = v/l(l+1)-m(m+l). (5.51) 

Note that c^ m is undetermined to an arbitrary phase-factor [i.e., we can replace 
cj m , given above, by c^ m exp(iy), where y is real, and we still satisfy Eq. (5.50)] . 
We have made the arbitrary, but convenient, choice that c^ m is real and posi- 
tive. This is equivalent to choosing the relative phases of the eigenkets |1, m). 
According to Eq. (5.49), 

Ci; m = (< m _! )* = VKl+1)-m(m-1). (5.52) 

We have already seen that the inequality (5.31) implies that there is a maxi- 
mum and a minimum possible value of m. The maximum value of m is denoted 
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I. What is the minimum value? Suppose that we try to lower the value of m 
below its minimum value m m i n . Since there is no state with m < m m i n , we must 
have 

L~|l,m mi n} = 0. (5.53) 
According to Eq. (5.35), this implies that 

q m . = 0. (5.54) 

It can be seen from Eq. (5.52) that m m i n = —I. We conclude that m can take a 
"ladder" of discrete values, each rung differing from its immediate neighbours by 
unity. The top rung is I, and the bottom rung is —I. There are only two possible 
choices for I. Either it is an integer (e.g., 1 = 2, which allows m to take the values 
— 2, — 1 , 0, 1 , 2), or it is a half-integer (e.g., I = 3/2, which allows m to take the 
values —3/2, —1/2, 1 /2,3/2). We will prove in the next section that an orbital 
angular momentum can only take integer values of I. 

In summary, using just the fundamental commutation relations (5.8)-(5.10), 
plus the fact that L x , L y , and L z are Hermitian operators, we have shown that the 
eigenvalues of L 2 = L x 2 + L y 2 + L z 2 can be written I (1+ 1 ) h 2 , where I is an integer, 
or a half-integer. We have also demonstrated that the eigenvalues of L z can only 
take the values m h, where m lies in the range —I, — I + 1 , • • • I — 1 , 1. Let |l, m) 
denote a properly normalized simultaneous eigenket of L 2 and L z , belonging to 
the eigenvalues I (I + 1 ) h 2 and m h, respectively. We have shown that 



L+|l,m) = J\(l+ 1) -m(m+ 1) 7i|l,m+ 1) (5.55) 



L _ |l,m) = VI (I + 1) -m(m- 1) Pi|l,m- 1), (5.56) 
where L ± = L x ± iL y are the so-called shift operators. 



5.3 Rotation operators 

Consider a particle described by the spherical polar coordinates (r, 9,(p). The 
classical momentum conjugate to the azimuthal angle (p is the z-component of 
angular momentum, L z . According to Sect. 3.5, in quantum mechanics we can 
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always adopt Schrodinger's representation, for which ket space is spanned by the 
simultaneous eigenkets of the position operators r, 0, and c{), and L z takes the 
form 

pi 

L z = -ih— . (5.57) 
3(p 

We can do this because there is nothing in Sect. 3.5 which specifies that we 
have to use Cartesian coordinates — the representation (3.74) works for any well- 
defined set of coordinates. 

Consider an operator R(Acp) which rotates the system an angle Acp about the z- 
axis. This operator is very similar to the operator D(Ax), introduced in Sect. 3.8, 
which translates the system a distance Ax along the x axis. We were able to 
demonstrate in Sect. 3.8 that 

p x = i?i lim P^" 1 (5.58) 

where p x is the linear momentum conjugate to x. There is nothing in our deriva- 
tion of this result which specifies that x has to be a Cartesian coordinate. Thus, 
the result should apply just as well to an angular coordinate. We conclude that 

U = ihlun Rl&V J-\ (5.59) 

According to Eq. (5.59), we can write 

R(5cp) = 1 -iL z 6(p/h (5.60) 

in the limit 6(p — > 0. In other words, the angular momentum operator L z can be 
used to rotate the system about the z-axis by an infinitesimal amount. We say 
that L z is the generator of rotations about the z-axis. The above equation implies 
that 

R(A<p)=lim(l-i^) N , (5.61) 

which reduces to 

R(Acp) = exp(-iL z Acp/Ti). (5.62) 
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Note that R(Acp) has all of the properties we would expect of a rotation operator 

R(0) = 1, (5.63) 
R(Acp)R(-Acp) = 1, (5.64) 
R(A(pi)R(A(p 2 ) = R(A(p] + A<p 2 ). (5.65) 

Suppose that the system is in a simultaneous eigenstate of L 2 and L z . As before, 
this state is represented by the eigenket |l,m), where the eigenvalue of L 2 is 
1(1+ 1 ) Pi 2 , and the eigenvalue of L z is m h. We expect the wave-function to 
remain unaltered if we rotate the system 2n degrees about the z-axis. Thus, 

R(27t)|l,m) = exp(— iL z 27t/Ti)|l,m) = exp(— i27tm)|l,m) = |l,m). (5.66) 

We conclude that m must be an integer. This implies, from the previous section, 
that I must also be an integer. Thus, orbital angular momentum can only take on 
integer values of the quantum numbers I and m. 

Consider the action of the rotation operator R(Acp) on an eigenstate possessing 
zero angular momentum about the z-axis (i.e., an m = state). We have 

R(A<p)|l,0) =exp(0)|l,0) = |l,0). (5.67) 

Thus, the eigenstate is invariant to rotations about the z-axis. Clearly, its wave- 
function must be symmetric about the z-axis. 

There is nothing special about the z-axis, so we can write 

R x (A(p x ) = exp(-iL x A(p x /h), (5.68) 
R y (A<p y ) = exp(-iL y A<p y /Fi), (5.69) 
R z (A(p y ) = exp(-iL z A(p z /ru), (5.70) 

by analogy with Eq. (5.62). Here, R x (A(p x ) denotes an operator which rotates the 
system by an angle A(p x about the x-axis, etc. Suppose that the system is in an 
eigenstate of zero overall orbital angular momentum (i.e., an I = state). We 
know that the system is also in an eigenstate of zero orbital angular momentum 
about any particular axis. This follows because 1 = implies m = 0, according 
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to the previous section, and we can choose the z-axis to point in any direction. 
Thus, 

R x (A<p x )|0,0) = exp(0)|0,0) = |0,0), (5.71) 
Ry(A<pJ0,0) = exp(0)|0,0) = |0,0), (5.72) 
R 2 (Acp z )|0,0) = exp(0)|0,0) = |0,0). (5.73) 

Clearly, a zero angular momentum state is invariant to rotations about any axis. 
Such a state must possess a spherically symmetric wave-function. 

Note that a rotation about the x-axis does not commute with a rotation about 
the u-axis. In other words, if the system is rotated an angle Acp x about the x-axis, 
and then Acp v about the u-axis, it ends up in a different state to that obtained 
by rotating an angle A(p y about the u-axis, and then A(p x about the x-axis. In 
quantum mechanics, this implies that R v {A(p y ) R x (A(p x ) ^ R x (A(p x ) R v [A<p y ), or 
L y L x 7^ L x L y , [see Eqs. (5.68)-(5.70)]. Thus, the noncommuting nature of the 
angular momentum operators is a direct consequence of the fact that rotations 
do not commute. 



5.4 Eigenfunctions of orbital angular momentum 

In Cartesian coordinates, the three components of orbital angular momentum can 
be written 



9 9 ^ 
dz dy) ' 



U = -ih(y^--z^-), (5.74) 



L v = -i^zi-xl), (5.75) 

u = - i?i ( x ^-"y^)> ^ 5 - 76 ^ 

using the Schrodinger representation. Transforming to standard spherical polar 
coordinates, 

x = rsin0cos(p, (5.77) 
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y = rsin0sin(p, (5.78) 

z = r cos 9, (5.79) 

we obtain 

(3 3 \ 

sin (p — + cot cos (p - I (5.80) 
30 3(p/ 

(3 3 \ 

cos <P 17 _ cot sin cp - I (5.81) 
30 3(p/ 

L z = -iTt— . (5.82) 
3(p 

Note that Eq. (5.82) accords with Eq. (5.57). The shift operators L ± = L x ± iL^ 
become 

— ±icot0— J. (5.83) 



Now, 



L 2 = L x 2 + L y 2 + L z 2 = L z 2 + (L + L + L L+)/2, (5.84) 

L 2 = - h 2 f JL 4 sin e 9 + _J* ) . (5.85) 
\sin0 30 30 sin 2 3(p 2 / 

The eigenvalue problem for L 2 takes the form 

L 2 ip = A rt 2 i|), (5.86) 
where ip(r, 0, cp) is the wave-function, and A is a number. Let us write 

i|>(r,9,<p) = R(r)Y(9,<p). (5.87) 

Equation (5.86) reduces to 

i ?\ ?\ i tO- \ 

sin0— + . 2^—! ] Y + A Y = 0, (5.88) 



so 



sin 30 30 sin 2 3(p 2 

where use has been made of Eq. (5.85). As is well-known, square integrable 
solutions to this equation only exist when A takes the values I (I + 1 ), where I is 
an integer. These solutions are known as spherical harmonics, and can be written 



YT(e, <p) = . 2l / 1 n_ L m ll H ) m e im? Pi m (cos cp), (5.89) 
^ An (l + mj! 
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where m is a positive integer lying in the range < m < I. Here, P™(£) is an 
associated Legendre function satisfying the equation 



d 



m 



d£, 



,2 



m 

P l m + l(l+1)P l m = 0. (5.90) 



d£, 
We define 

Yf m = (-1) m (Y L m )*, (5.91) 

which allows m to take the negative values — I < m < 0. The spherical harmonics 
are orthogonal functions, and are properly normalized with respect to integration 
over the entire solid angle: 

[ YT > (e,<p)YII l '(e,(p)sineded(p = 6 U /6 mm /. (5.92) 
o Jo 

The spherical harmonics also form a complete set for representing general func- 
tions of and cp. 

By definition, 

L 2 Y L m = 1(1+1) ru 2 Y L m , (5.93) 
where I is an integer. It follows from Eqs. (5.82) and (5.89) that 

L z Y l m = mruY l m , (5.94) 

where m is an integer lying in the range — I < m < I. Thus, the wave-function 
i|;(r, 0, (p) = R(r) Y[ a (0, where R is a general function, has all of the expected 
features of the wave-function of a simultaneous eigenstate of L 2 and L z belonging 
to the quantum numbers I and m. The well-known formula 



>m 



d^ VT^' 1 l-^ 2 1 

(l + m)(l-m+1) ! mj 

can be combined with Eqs. (5.83) and (5.89) to give 

L+Y L m = ^l(l+l)-m(m+l) PiY^ 1 , (5.96) 



VY™ = y/l (I + 1 ) - m (m - 1 ) FiY™ -1 . (5.97) 
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These equations are equivalent to Eqs. (5.55)-(5.56). Note that a spherical har- 
monic wave-function is symmetric about the z-axis (i.e., independent of (p) when- 
ever m = 0, and is spherically symmetric whenever 1 = (since Yg = 1 /V4n). 

In summary by solving directly for the eigenfunctions of L 2 and L z in Schrod- 
inger's representation, we have been able to reproduce all of the results of Sect. 5.2 
Nevertheless, the results of Sect. 5.2 are more general than those obtained in this 
section, because they still apply when the quantum number I takes on half- integer 
values. 



5.5 Motion in a central field 

Consider a particle of mass M moving in a spherically symmetric potential. The 
Hamiltonian takes the form 



H 



2M 



+ V(r). 



(5.98) 



Adopting Schrodinger's representation, we can write p = — (i/Ti)V. Hence, 

.2 



H 



2M 



V 2 + V(r). 



(5.99) 



When written in spherical polar coordinates, the above equation becomes 



H 



h 1 



2M 



i a 2 a 

-r— + 



1 3.^3 

sin9— + 



1 



r 2 3r 3r r 2 sin 30 30 r 2 sin 3(p 2 



+ V(t). (5.100) 



Comparing this equation with Eq. (5.85), we find that 



H 



ft 2 
2M 



1 3 



+ 



r 2 3r 3r h 2 r 2 



+ V(t). 



(5.101) 



Now, we know that the three components of angular momentum commute 
with L 2 (see Sect. 5.1). We also know, from Eqs. (5.80)-(5.82), that L x , L y , 
and L z take the form of partial derivative operators of the angular coordinates, 
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when written in terms of spherical polar coordinates using Schrodinger's repre- 
sentation. It follows from Eq. (5.101) that all three components of the angular 
momentum commute with the Hamiltonian: 

[L,H]=0. (5.102) 

It is also easily seen that L 2 commutes with the Hamiltonian: 

[L 2 ,H]=0. (5.103) 

According to Sect. 4.2, the previous two equations ensure that the angular mo- 
mentum L and its magnitude squared L 2 are both constants of the motion. This 
is as expected for a spherically symmetric potential. 

Consider the energy eigenvalue problem 

Hip = Eip, (5.104) 

where E is a number. Since E 2 and E z commute with each other and the Hamil- 
tonian, it is always possible to represent the state of the system in terms of the 
simultaneous eigenstates of E 2 , E z , and H. But, we already know that the most 
general form for the wave-function of a simultaneous eigenstate of E 2 and E z is 
(see previous section) 

iMr,e,cp) = R(r)Yr(e,cp). (5.105) 

Substituting Eq. (5.105) into Eq. (5.101), and making use of Eq. (5.93), we ob- 
tain 

2M [ r 2 dr r dr ' r 2 1 + (rJ 

This is a Sturm-Liouville equation for the function R(r). We know, from the gen- 
eral properties of this type of equation, that if R(r) is required to be well-behaved 
at r = and as r — > oo then solutions only exist for a discrete set of values of E. 
These are the energy eigenvalues. In general, the energy eigenvalues depend on 
the quantum number I, but are independent of the quantum number m. 



R = 0. (5.106) 
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5.6 Energy levels of the hydrogen atom 



Consider a hydrogen atom, for which the potential takes the specific form 

V(r)=-- e ^. (5.107) 

47T£o r 

The radial eigenfunction R(r) satisfies Eq. (5.106), which can be written 



h 2 



1 d 2 d 1(1 + 1) 



-r~ h 

r 2 dr dr 



47teor 



R = 0. 



(5.108) 



Here, \i = m e m p /(m e +m p ) is the reduced mass, which takes into account the fact 
that the electron (of mass m e ) and the proton (of mass m p ) both rotate about a 
common centre, which is equivalent to a particle of mass yi rotating about a fixed 
point. Let us write the product rR(r) as the function P(r). The above equation 
transforms to 



d 2 P _2\i /l(l+l)Ti" 
dr 2 " 



E P = 0, 



(5.109) 



2|j.r 2 Atxcqt 

which is the one-dimensional Schrodinger equation for a particle of mass u. mov- 
ing in the effective potential 

~ 2 l(l+l)?i 2 



Veff(T) 



+ 



(5.110) 



The effective potential has a simple physical interpretation. The first part is the 
attractive Coulomb potential, and the second part corresponds to the repulsive 
centrifugal force. 



Let 



and y —r/a, with 



a 



P(r)=f(u)exp(-y). 



(5.111) 



(5.112) 



Here, it is assumed that the energy eigenvalue E is negative. Equation (5.109) 
transforms to 



l(!±1) + ^Llf = 0. 



4ne Pi y 



(5.113) 
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Let us look for a power-law solution of the form 

n 

Substituting this solution into Eq. (5.113), we obtain 



(5.114) 



n-1 



^c n I n(n- 1)u n 2 -2ny 

n I 

Equating the coefficients of y n 2 gives 
c n [n (n — 1 ) — I (I + 1 )] = c n _] 



l(l + 1)u n - 2 + 



2(n-1) 



2 ^ 2fl n-1 

47re h 2 



2 \ie 2 a 
4ne h 2 



0. (5.115) 



(5.116) 



Now, the power law series (5.114) must terminate at small u, at some positive 
value of n, otherwise f(y) behaves unphysically as y — > 0. This is only possible if 
[n-min(n.min — 1 ) — I (1 + 1 )] =0, where the first term in the series is c nmin -y nmin . There 
are two possibilities: n m i n = —I or n m i n = I + 1 . The former predicts unphysical 
behaviour of the wave-function at y = 0. Thus, we conclude that n m i n = 1+1. 
Note that for an I = state there is a finite probability of finding the electron at 
the nucleus, whereas for an I > state there is zero probability of finding the 
electron at the nucleus (i.e., |ij>| 2 = at r = 0, except when 1 = 0). Note, also, 
that it is only possible to obtain sensible behaviour of the wave-function as r — > 
if I is an integer. 



For large values of y, the ratio of successive terms in the series (5.114) is 

c n V 2y 



Cn-1 



n 



(5.117) 



according to Eq. (5.116). This is the same as the ratio of successive terms in the 
series 

(2d)' 



n 



TL! 



(5.118) 



which converges to exp(2u). We conclude that f(y) — > exp(2u) as y — > oo. It 
follows from Eq. (5.112) that R(r) — > exp(r/a)/r as r — > oo. This does not cor- 
respond to physically acceptable behaviour of the wave-function, since J |\1»| 2 dV 
must be finite. The only way in which we can avoid this unphysical behaviour is 
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if the series (5.114) terminates at some maximum value of u. According to the 
recursion relation (5.116), this is only possible if 

h n, (5.119) 



4ne h 2 



where the last term in the series is c n y n . It follows from Eq. (5.111) that the 
energy eigenvalues are quantized, and can only take the values 

^ (5.120) 



327r 2 e 2 Pi 2 u 2 ' 



Here, n is a positive integer which must exceed the quantum number I, otherwise 
there would be no terms in the series (5.114). 

It is clear that the wave-function for a hydrogen atom can be written 

il)(T,e,<p)=R(T/a)YT(e,cp), (5.121) 

where 



a 



n 47ieo h 



\xe 2 



5.3 x 10 11 u meters, (5.122) 



and R(x) is a well-behaved solution of the differential equation 

1 d 2 d 1(1+1) 2n \ 

-77- *i ; — + 1 R = 0. (5.123) 

x z dx dx x l x ) 

Finally, the Y™ are spherical harmonics. The restrictions on the quantum numbers 
are |m| < I < n. Here, n is a positive integer, I is a non-negative integer, and m 
is an integer. 

The ground state of hydrogen corresponds to n = 1 . The only permissible 
values of the other quantum numbers are 1 = and m = 0. Thus, the ground 
state is a spherically symmetric, zero angular momentum state. The energy of 
the ground state is 

4 

' tL -13.6 electron volts. (5.124) 



32tt 2 e 2 h 2 
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The next energy level corresponds to n = 2. The other quantum numbers are 
allowed to take the values 1 = 0, m = 0orl = 1, m = — 1 , 0, 1 . Thus, there are 
n = 2 states with non-zero angular momentum. Note that the energy levels given 
in Eq. (5.120) are independent of the quantum number I, despite the fact that I 
appears in the radial eigenfunction equation (5.123). This is a special property 
of a 1 /r Coulomb potential. 

In addition to the quantized negative energy state of the hydrogen atom, which 
we have just found, there is also a continuum of unbound positive energy states. 

5.7 Spin angular momentum 

Up to now, we have tacitly assumed that the state of a particle in quantum me- 
chanics can be completely specified by giving the wave-function ip as a function 
of the spatial coordinates x, y, and z. Unfortunately, there is a wealth of experi- 
mental evidence which suggests that this simplistic approach is incomplete. 

Consider an isolated system at rest, and let the eigenvalue of its total angular 
momentum be j (j + 1 ) h 2 . According to the theory of orbital angular momentum 
outlined in Sects. 5.4 and 5.5, there are two possibilities. For a system consisting 
of a single particle, j = 0. For a system consisting of two (or more) particles, ) is 
a non-negative integer. However, this does not agree with observations, because 
we often find systems which appear to be structureless, and yet have ) ^ 0. Even 
worse, systems where j has half-integer values abound in nature. In order to 
explain this apparent discrepancy between theory and experiments, Gouldsmit 
and Uhlenbeck (in 1925) introduced the concept of an internal, purely quantum 
mechanical, angular momentum called spin. For a particle with spin, the total 
angular momentum in the rest frame is non-vanishing. 

Let us denote the three components of the spin angular momentum of a par- 
ticle by the Hermitian operators (S x , S y , S z ) = S. We assume that these operators 
obey the fundamental commutation relations (5.8)-(5.10) for the components of 
an angular momentum. Thus, we can write 

SxS = i?iS. (5.125) 
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We can also define the operator 



S S x 4- Sy -\- S z . 



(5.126) 



According to the quite general analysis of Sect. 5.1, 

[S,S 2 ]=0. 



(5.127) 



Thus, it is possible to find simultaneous eigenstates of S 2 and S z . These are 
denoted |s, s z ), where 



According to the equally general analysis of Sect. 5.2, the quantum number s can, 
in principle, take integer or half-integer values, and the quantum number s z can 
only take the values s,s — 1 s + 1 , — s. 

Spin angular momentum clearly has many properties in common with orbital 
angular momentum. However, there is one vitally important difference. Spin 
angular momentum operators cannot be expressed in terms of position and mo- 
mentum operators, like in Eqs. (5.1)-(5.3), since this identification depends on 
an analogy with classical mechanics, and the concept of spin is purely quantum 
mechanical: i.e., it has no analogy in classical physics. Consequently, the re- 
striction that the quantum number of the overall angular momentum must take 
integer values is lifted for spin angular momentum, since this restriction (found 
in Sects. 5.3 and 5.4) depends on Eqs. (5.1)-(5.3). In other words, the quantum 
number s is allowed to take half-integer values. 

Consider a spin one-half particle, for which 



Here, the |±) denote eigenkets of the S z operator corresponding to the eigen- 
values ±h/2. These kets are orthonormal (since S z is an Hermitian operator), 
so 



S z |s,s z ^ — s z h-js^^, 
S 2 |s,s z ) = s (s + 1) Ti 2 |s,s z ). 



(5.128) 



(5.129) 




(5.130) 



(5.131) 



<+!->= o. 



(5.132) 
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They are also properly normalized and complete, so that 

<+!+} = H-) = 1, (5.133) 

and 

l+)(+l + l-X-l = i- ( 5 - 134 ) 

It is easily verified that the Hermitian operators defined by 

S x = ^(l+X-l + l-X+l), (5-135) 
s y = y (-I+X-I + I-X+I), (5.136) 

S z = \ ( l+X+l - l-X-l ) , (5.137) 

satisfy the commutation relations (5.8)-(5.10) (with the Lj replaced by the Sj). 
The operator S 2 takes the form 

3 Ft 2 

S 2 = (5.138) 
4 

It is also easily demonstrated that S 2 and S z , defined in this manner, satisfy the 
eigenvalue relations (5.130)-(5.131). Equations (5.135)-(5.138) constitute a re- 
alization of the spin operators S and S 2 (for a spin one-half particle) in spin space 
(i.e., that Hilbert sub-space consisting of kets which correspond to the different 
spin states of the particle) . 



5.8 Wave-function of a spin one-half particle 

The state of a spin one-half particle is represented as a vector in ket space. 
Let us suppose that this space is spanned by the basis kets \x' y y' } z' } ±). Here, 
\x' y y' } z' } ±) denotes a simultaneous eigenstate of the position operators x, y, z, 
and the spin operator S z , corresponding to the eigenvalues x', y', z', and ±Ti/2, 
respectively. The basis kets are assumed to satisfy the completeness relation 

(|x / ,-y / ,z , ,+)(x / ,-y / ,z , ,+| + |x / ,-y / ,z / ,-)<x / ,-y / ,z , ,-|) dx'du'dz' = 1. 

(5.139) 
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It is helpful to think of the ket Ix^y', z',+) as the product of two kets — a 
position space ket \x' } y\z'), and a spin space ket |+). We assume that such a 
product obeys the commutative and distributive axioms of multiplication: 

|x , ,-y , ,z , }|+) = |+}|x , ,-y , ,z / ), (5.140) 

(cV ) u' ) z'} + c"|x" ) i J " ) z"))|+) = cV,-y , ,z , )l+) 

+c / V,-y ,, ,z"}|+) (5.141) 
|x / ,-y / ,z , )(c + |+)+c_|-)) = c + |x / ,-y / ,z , )|+) 

+c_|x , ,y , ,z , )R, (5.142) 

where the c's are numbers. We can give meaning to any position space operator 
(such as L z ) acting on the product |x' ,y' ', z')|+) by assuming that it operates only 
on the \x' y y' } z') factor, and commutes with the |+) factor. Similarly, we can give 
a meaning to any spin operator (such as S z ) acting on |x',y z')|+) by assuming 
that it operates only on |+), and commutes with |x', y ' , z') . This implies that every 
position space operator commutes with every spin operator. In this manner, we 
can give meaning to the equation 

|x , ,-y , ,z , ,±) = |x',y',z')|±) = \±)\x' ,y\ z'). (5.143) 

The multiplication in the above equation is of quite a different type to any 
which we have encountered previously. The ket vectors \x',y',z') and |±) are 
in two quite separate vector spaces, and their product \x' y y'> z')|±) is in a third 
vector space. In mathematics, the latter space is termed the product space of the 
former spaces, which are termed factor spaces. The number of dimensions of a 
product space is equal to the product of the number of dimensions of each of the 
factor spaces. A general ket of the product space is not of the form (5.143), but 
is instead a sum or integral of kets of this form. 

A general state A of a spin one-half particle is represented as a ket ||A)) in the 
product of the spin and position spaces. This state can be completely specified by 
two wavefunctions: 

iP + (x , ,-y , ,z / ) = (x'.y'.z'K+HA)), (5.144) 
iMx',y',z') = (x'.y'.z'K-HA)). (5.145) 
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The probability of observing the particle in the region x' to x' + dx', y ' to y ' + dy ', 
and z' to z' + dz' , with s z = +1/2 is \^> + {x',y', z')\ 2 dx'dy'dz'. Likewise, the 
probability of observing the particle in the region x' to x' + dx', y' to y ' + dy' , 
and z' to z' + dz' , with s z = —1 /2 is |ij> _(x',u', z')| 2 dx'dy'dz'. The normalization 
condition for the wavefunctions is 

(|ip + | 2 + |i|;_| 2 ) dx'dy'dz' = 1. (5.146) 
5.9 Rotation operators in spin space 

Let us, for the moment, forget about the spatial position of the particle, and 
concentrate on its spin state. A general spin state A is represented by the ket 

|A) = (+|A)l+) + <-|A)h) (5.147) 

in spin space. In Sect. 5.3, we were able to construct an operator R z (A(p) which 
rotates the system by an angle Acp about the z-axis in position space. Can we also 
construct an operator T z (A(p) which rotates the system by an angle A(p about 
the z-axis in spin space? By analogy with Eq. (5.62), we would expect such an 
operator to take the form 

T z (A<p) = exp(-iS z A(p/h). (5.148) 
Thus, after rotation, the ket |A) becomes 

|A R )=T z (A(p)|A). (5.149) 

To demonstrate that the operator (5.148) really does rotate the spin of the 
system, let us consider its effect on (S x ). Under rotation, this expectation value 
changes as follows: 

(S x ) -> (A R |S X |A R ) = (A|T z tS x T z |A). (5.150) 
Thus, we need to compute 

exp( iS z Acp/Ti) S x exp(— iS z A(p/rt). (5.151) 
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This can be achieved in two different ways. 



First, we can use the explicit formula for S x given in Eq. (5.135). We find that 
Eq. (5.151) becomes 



^exp( iS z Acp/h) ( |+}(-| + |-)(+| ) exp(-iS z Acp/Ti), 



or 



(« 



,iAcp/2 |_|_^_| e iA( P/ 2 _|_ g-iAcp/2 



■X+le 



-iAcp/2 



which reduces to 

S x cos Acp — S y sin Acp, 
where use has been made of Eqs. (5.135)-(5.137). 



(5.152) 

(5.153) 
(5.154) 



A second approach is to use the so called Baker-Hausdorff lemma. This takes 
the form 



exp(iG A) A exp(-iGA) = A + iA[G,A] + 

'i n A n 



2! 



[G,[G,A]]+ (5.155) 



/i n A n \ 



where G is a Hermitian operator, and A is a real parameter. The proof of this 
lemma is left as an exercise. Applying the Baker-Hausdorff lemma to Eq. (5.151), 
we obtain 

2 



i Acp 



Pi 



[S z , sj + 



[S z , [S z , SJ] + 



(5.156) 



which reduces to 



1-^ + 
2! 



Acp 3 

cp + 

^ 3! 



(5.157) 



or 



S x cos Acp — S y sin Acp, (5.158) 

where use has been made of Eq. (5.125). The second proof is more general than 
the first, since it only uses the fundamental commutation relation (5.125), and is, 
therefore, valid for systems with spin angular momentum higher than one-half. 
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For a spin one-half system, both methods imply that 

(S x ) -> (S x ) cos Acp - (S^) sinA(p (5.159) 

under the action of the rotation operator (5.148). It is straight-forward to show 
that 

(S v ) -> (Sy) cos Acp + (S x ) sin Acp. (5.160) 

Furthermore, 

<S Z > -> (S z ), (5.161) 

since S z commutes with the rotation operator. Equations (5.159)-(5.161) demon- 
strate that the operator (5.148) rotates the expectation value of S by an angle Acp 
about the z-axis. In fact, the expectation value of the spin operator behaves like 
a classical vector under rotation: 

(S k )^^Rki(Si), (5.162) 
i 

where the R^i are the elements of the conventional rotation matrix for the rotation 
in question. It is clear, from our second derivation of the result (5.159), that this 
property is not restricted to the spin operators of a spin one-half system. In fact, 
we have effectively demonstrated that 

(Jk)^^Rki(h), (5.163) 
i 

where the J k are the generators of rotation, satisfying the fundamental commuta- 
tion relation J x J = i h J, and the rotation operator about the kth axis is written 
R k (Acp) =exp(-iJ k Acp/h). 

Consider the effect of the rotation operator (5.148) on the state ket (5.147). 
It is easily seen that 

T z (Acp)|A) = e- iA(p/2 (+|A)|+) + e iA(p/2 (-|A)|-). (5.164) 

Consider a rotation by 2n radians. We find that 

|A> -> T z (27t)|A) = -|A). (5.165) 
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Note that a ket rotated by 2n radians differs from the original ket by a minus 
sign. In fact, a rotation by 4n radians is needed to transform a ket into itself. 
The minus sign does not affect the expectation value of S, since S is sandwiched 
between (A| and |A), both of which change sign. Nevertheless, the minus sign 
does give rise to observable consequences, as we shall see presently. 



5.10 Magnetic moments 

Consider a particle of charge q and velocity v performing a circular orbit of radius 
r in the x-y plane. The charge is equivalent to a current loop of radius r in the 
x-y plane carrying current I = q v/27tr. The magnetic moment \x of the loop is of 
magnitude nr 2 I and is directed along the z-axis. Thus, we can write 

q 

M- = 2 r x v ) (5.166) 

where r and v are the vector position and velocity of the particle, respectively. 
However, we know that p = v/m, where p is the vector momentum of the parti- 
cle, and m is its mass. We also know that L = r x p, where L is the orbital angular 
momentum. It follows that 

H=^L. (5.167) 
2m 

Using the usual analogy between classical and quantum mechanics, we expect the 
above relation to also hold between the quantum mechanical operators, fi and L, 
which represent magnetic moment and orbital angular momentum, respectively. 
This is indeed found to the the case. 

Does spin angular momentum also give rise to a contribution to the magnetic 
moment of a charged particle? The answer is "yes". I n f act > relativistic quantum 
mechanics actually predicts that a charged particle possessing spin should also 
possess a magnetic moment (this was first demonstrated by Dirac) . We can write 

^ = ^ L (L+gS), (5.168) 
2m 

where g is called the gyromagnetic ratio. For an electron this ratio is found to be 
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The factor 2 is correctly predicted by Dirac's relativistic theory of the electron. 
The small correction 1 /(2n 137), derived originally by Schwinger, is due to quan- 
tum field effects. We shall ignore this correction in the following, so 

H~-^(L + 2S) (5.170) 

2m e 

for an electron (here, e > 0) . 



5.11 Spin precession 

The Hamiltonian for an electron at rest in a z-directed magnetic field, B = B % is 

H = -n-B = [ —) SB = cuS z , (5.171) 
\m e / 

where 

cu = — . (5.172) 

m e 

According to Eq. (4.28), the time evolution operator for this system is 

T(t,0) = exp(-iHt/h) = exp(-iS z cu t/Pi). (5.173) 

It can be seen, by comparison with Eq. (5.148), that the time evolution operator 
is precisely the same as the rotation operator for spin, with Acp set equal to cu t. 
It is immediately clear that the Hamiltonian (5.171) causes the electron spin to 
precess about the z-axis with angular frequency cu. In fact, Eqs. (5.159)-(5.161) 
imply that 



(S x ) t = (S x ) t =ocoscut- (S y )t=osincut, (5.174) 
(Sy) t = (S^)t=ocoscut + (S x ) t =osincut, (5.175) 
(S z ) t = (S z ) t=0 . (5.176) 

The time evolution of the state ket is given by analogy with Eq. (5.164): 

|A,t> =e- [wt/2 {+\A ) 0)\+) + e iwt/2 {-\A,0)\-). (5.177) 
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Note that it takes time t = 4n/cv for the state ket to return to its original state. By 
contrast, it only takes times t = 2n/a) for the spin vector to point in its original 
direction. 

We now describe an experiment to detect the minus sign in Eq. (5.165). An 
almost monoenergetic beam of neutrons is split in two, sent along two different 
paths, A and B, and then recombined. Path A goes through a magnetic field free 
region. However, path B enters a small region where a static magnetic field is 
present. As a result, a neutron state ket going along path B acquires a phase-shift 
exp(=pi cu T/2) (the =p signs correspond to s z = ±1 /2 states). Here, T is the time 
spent in the magnetic field, and cu is the spin precession frequency 

cu = (5.178) 

This frequency is defined in an analogous manner to Eq. (5.172). The gyro- 
magnetic ratio for a neutron is found experimentally to be g n = —1.91. (The 
magnetic moment of a neutron is entirely a quantum field effect). When neu- 
trons from path A and path B meet they undergo interference. We expect the 
observed neutron intensity in the interference region to exhibit a cos(±cu T/2 + 6) 
variation, where 6 is the phase difference between paths A and B in the absence 
of a magnetic field. In experiments, the time of flight T through the magnetic 
field region is kept constant, while the field-strength B is varied. It follows that 
the change in magnetic field required to produce successive maxima is 

AB = (5.179) 

where I is the path-length through the magnetic field region, and X is the de 
Broglie wavelength over 2n of the neutrons. The above prediction has been ver- 
ified experimentally to within a fraction of a percent. This prediction depends 
crucially on the fact that it takes a 4n rotation to return a state ket to its original 
state. If it only took a 2n rotation then AB would be half of the value given above, 
which does not agree with the experimental data. 
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5.12 Pauli two-component formalism 

We have seen, in Sect. 5.4, that the eigenstates of orbital angular momentum 
can be conveniently represented as spherical harmonics. In this representation, 
the orbital angular momentum operators take the form of differential operators 
involving only angular coordinates. It is conventional to represent the eigenstates 
of spin angular momentum as column (or row) matrices. In this representation, 
the spin angular momentum operators take the form of matrices. 

The matrix representation of a spin one-half system was introduced by Pauli in 
1926. Recall, from Sect. 5.9, that a general spin ket can be expressed as a linear 
combination of the two eigenkets of S z belonging to the eigenvalues ± Fi/2. These 
are denoted |±). Let us represent these basis eigenkets as column matrices: 



The corresponding eigenbras are represented as row matrices: 



In this scheme, a general bra takes the form 

iA>r 



A) = (+|A)|+) + (-|A)|-> -> I ) + '^ I , (5.184) 



and a general ket becomes 



1+) -» (J)=x+, 



(+1 -> (1,0) =xt, 

(-1 -> (o,i) =xL 



(5.180) 
(5.181) 

(5.182) 
(5.183) 



<A| = (A|+)(+| + (A|-)(-| -> ((A|+), (A|-)). (5.185) 
The column matrix (5.184) is called a two-component spinor, and can be written 

X= ((HA>) = (c + ) =C+X+ + C - X -' (5 - 186) 
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where the c± are complex numbers. The row matrix (5.185) becomes 

X f = «A|+),(A|-)) = (c + *,c_*) = c + *xt + c_*xt- (5.187) 

Consider the ket obtained by the action of a spin operator on ket A: 

|A') =S k |A). (5.188) 



This ket is represented as 



|A'>-> [ ] = x'- (5-189) 



However, 



(+|A'> = (+|S k |+)(+|A) + <+|S k |-)<-|A) ) (5.190) 
(-|A'> = (-|S k |+}(+|A} + (-|S k |-)HA), (5.191) 



or 



(5.192) 



;+|A'}\ = /<+|S k |+) <+|S k |-)W(+|A) 
'-\A')) l(-|S k |+) (-|S k |->J \(-\A) 

It follows that we can represent the operator/ket relation (5.188) as the matrix 
relation 

where the a k are the matrices of the (±|S k |±) values divided by h/2. These 
matrices, which are called the Pauli matrices, can easily be evaluated using the 
explicit forms for the spin operators given in Eqs. (5.135)-(5.137). We find that 

a, = I ? 1 1 , (5/194) 



1 

- 

1 

1 

-1 



o: = ( °. * I , (5.195) 



0-3 = I ! ° 1 I ■ (5/196) 
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Here, 1, 2, and 3 refer to x, y, and z, respectively. Note that, in this scheme, we 
are effectively representing the spin operators in terms of the Pauli matrices: 



o k . (5.197) 



The expectation value of can be written in terms of spinors and the Pauli 
matrices: 

S k > = (A|S k |A) = 2j A l±><±l s kl±)<±IA) = x t a kX . (5.198) 

The fundamental commutation relation for angular momentum, Eq. (5.125), 
can be combined with (5.197) to give the following commutation relation for the 
Pauli matrices: 

o-xo- = 2io\ (5.199) 

It is easily seen that the matrices (5.194)-(5.196) actually satisfy these relations 
(i.e., o"i o"2 — o"2 o"i = 2i 03, plus all cyclic permutations). It is also easily seen that 
the Pauli matrices satisfy the anti-commutation relations 

{0-1,0-^ = 2 6^. (5.200) 

Let us examine how the Pauli scheme can be extended to take into account the 
position of a spin one-half particle. Recall, from Sect. 5.8, that we can represent 
a general basis ket as the product of basis kets in position space and spin space: 

|x',-u',z',±) = |x',y',z')|±) = |±)|x',u ', z'). (5.201) 

The ket corresponding to state A is denoted 1 1 A}), and resides in the product 

space of the position and spin ket spaces. State A is completely specified by the 
two wave-functions 

iMx , ,-y , ,z / ) = (x'.y'.z'K+UA)), (5.202) 

iMx , ,-y , ,z / ) = (x'.y'.z'K-IIA)}. (5.203) 
Consider the operator relation 

||A'» =S k ||A)). (5.204) 
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It is easily seen that 

(x',-y',z'|<+|A'» = (+|S k |+}<x^',z'|<+||A}> 

+ (+|S k |-)(x , ,-y , ,z , |(-||A)), (5.205) 

(x\y\z'\(-\A')) = (-ISkl+Xx'.-y'.z'K+HA)) 

+ (-|S k |-)(x , ,-y , ,z , |(-||A)}, (5.206) 

where use has been made of the fact that the spin operator S k commutes with 
the eigenbras (x' y y' } z'\. It is fairly obvious that we can represent the operator 
relation (5.204) as a matrix relation if we generalize our definition of a spinor by 
writing 

I|A»-(J:£!)=X> (5.207) 

and so on. The components of a spinor are now wave-functions, instead of com- 
plex numbers. In this scheme, the operator equation (5.204) becomes simply 

x' = Q)o- k x- (5-208) 

Consider the operator relation 

HA')) =p k ||A)>. (5.209) 
In the Schrodinger representation, we have 

(x',y',z'|<+|A'» = (x',-y',z'|p k <+IIA» 

= -irv/^x'.-y'.z'K+llA)), (5.210) 
ox k 

<x',y',z'IHA'» = (x\y\z'\ Vk (-\\A)) 

= -ih^{x' y y' y z'\{-\\A)) y (5.211) 
ox k 

where use has been made of Eq. (3.78). The above equation reduces to 



>i(r')\ Z-iTiaaMr')/9*k 



k 



ipl(r') ) \ -ihd^\)-{r')/dxl) ' 

102 



(5.212) 



5. 12 Pauli two-component formalism 



5 ANGULAR MOMENTUM 



Thus, the operator equation (5.209) can be written 

X' = PkX, 



(5.213) 



where 



(5.214) 



Here, I is the 2x2 unit matrix. In fact, any position operator (e.g., Pk or L\J 
is represented in the Pauli scheme as some differential operator of the position 
eigenvalues multiplied by the 2x2 unit matrix. 

What about combinations of position and spin operators? The most commonly 
occurring combination is a dot product: e.g., S L = (Ti/2)cr-L. Consider the 
hybrid operator er a, where a = (a x , a y , a z ) is some vector position operator. This 
quantity is represented as a 2 x 2 matrix: 



Since, in the Schrodinger representation, a general position operator takes the 
form of a differential operator in x' , y' , or z' , it is clear that the above quantity 
must be regarded as a matrix differential operator which acts on spinors of the 
general form (5.207). The important identity 



follows from the commutation and anti-commutation relations (5.199) and (5.200). 
Thus, 




(5.215) 



(cr-a) (cr-b) = a-b + icr-(a x b) 



(5.216) 



UiOj.OkH x[Oj,Ok]) Qj b k 



k 



k 




k 



a-b + icr-(a x b). 



(5.217) 



A general rotation operator in spin space is written 

T(A<t>) = exp (— iS-nAcp/Ti) 



(5.218) 
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by analogy with Eq. (5.148), where n is a unit vector pointing along the axis of 
rotation, and Acp is the angle of rotation. Here, n can be regarded as a trivial 
position operator. The rotation operator is represented 



exp (— iS-n Acp/Ti) — > exp (— iff-nA(p/2) 



(5.219) 



in the Pauli scheme. The term on the right-hand side of the above expression is 
the exponential of a matrix. This can easily be evaluated using the Taylor series 
for an exponential, plus the rules 

= 1 



(a-n) n = 1 for n even, (5.220) 

(a-n) n = (a-n) for n odd. (5.221) 

These rules follow trivially from the identity (5.216). Thus, we can write 



exp (— i a-n Acp/2) 



1 



(a-n) 2 /A(p\ 2 (a-n) 4 /Acp\ 



- 



+ 



4! 



(a-n) 



'A(p\ (a-n) 3 /A(p\ 



3! 



) 



1 



+ 



+ 



(5.222) 



(5.223) 



= cos(A(p/2) I — i sin(A(p/2) a-n. 

The explicit 2x2 form of this matrix is 

cos(A(p/2) — in z sin(A(p/2) (— in x — n v ) sin(A(p/2) 
(— in x + riy) sin(A(p/2) cos(A(p/2) + in z sin(A(p/2) 

Rotation matrices act on spinors in much the same manner as the corresponding 
rotation operators act on state kets. Thus, 

X' = exp(-ia-nA(p/2) X , (5.224) 

where x' denotes the spinor obtained after rotating the spinor x an angle Acp 
about the n-axis. The Pauli matrices remain unchanged under rotations. How- 
ever, the quantity x^°"kX is proportional to the expectation value of Sk [see 
Eq. (5.198)], so we would expect it to transform like a vector under rotation 
(see Sect. 5.9). In fact, we require 



(X f cikX)' = (X^o-icX' = Y. R ki (xW), 



(5.225) 
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where the Rki are the elements of a conventional rotation matrix. This is easily 
demonstrated, since 

/io"3A(p\ /— io"3A(p\ 
exp I — - — I &] exp I I = (j] cos A(p — cr2SinA(p (5.226) 

plus all cyclic permutations. The above expression is the 2x2 matrix analogue 
of (see Sect. 5.9) 

exp S x exp = cos A(p — S y sinAcp. (5.227) 

The previous two formulae can both be validated using the Baker-Hausdorff 
lemma, (5.156), which holds for Hermitian matrices, in addition to Hermitian 
operators. 



5.13 Spin greater than one-half systems 

In the absence of spin, the Hamiltonian can be written as some function of the 
position and momentum operators. Using the Schrodinger representation, in 
which p — > — i Pi V, the energy eigenvalue problem, 

H |E) =E|E), (5.228) 

can be transformed into a partial differential equation for the wave-function 
ip(r') = (r'|E). This function specifies the probability density for observing the 
particle at a given position, r\ In general, we find 

Hij)=Ei|), (5.229) 

where H is now a partial differential operator. The boundary conditions (for a 
bound state) are obtained from the normalization constraint 

|o|)| 2 dV=1. (5.230) 

This is all very familiar. However, we now know how to generalize this scheme 
to deal with a spin one-half particle. Instead of representing the state of the par- 
ticle by a single wave-function, we use two wave-functions. The first, i|) + (r / ), 
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specifies the probability density of observing the particle at position r' with spin 
angular momentum +h/2 in the z-direction. The second, ij> (r'), specifies the 
probability density of observing the particle at position v' with spin angular mo- 
mentum — Pi/2 in the z-direction. In the Pauli scheme, these wave-functions are 
combined into a spinor, x, which is simply the row vector of ij> + and In gen- 
eral, the Hamiltonian is a function of the position, momentum, and spin opera- 
tors. Adopting the Schrodinger representation, and the Pauli scheme, the energy 
eigenvalue problem reduces to 



where x is a spinor (i.e., a 1 x 2 matrix of wave-functions) and H is a 2 x 2 matrix 
partial differential operator [see Eq. (5.215)]. The above spinor equation can 
always be written out explicitly as two coupled partial differential equations for 
ip + and 

Suppose that the Hamiltonian has no dependence on the spin operators. In this 
case, the Hamiltonian is represented as diagonal 2x2 matrix partial differential 
operator in the Schrodinger/Pauli scheme [see Eq. (5.214)]. In other words, the 
partial differential equation for decouples from that for In fact, both 
equations have the same form, so there is only really one differential equation. 
In this situation, the most general solution to Eq. (5.231) can be written 



Here, ip(r') is determined by the solution of the differential equation, and the 
c± are arbitrary complex numbers. The physical significance of the above ex- 
pression is clear. The Hamiltonian determines the relative probabilities of finding 
the particle at various different positions, but the direction of its spin angular 
momentum remains undetermined. 

Suppose that the Hamiltonian depends only on the spin operators. In this 
case, the Hamiltonian is represented as a 2 x 2 matrix of complex numbers in 
the Schrodinger/Pauli scheme [see Eq. (5.197)], and the spinor eigenvalue equa- 
tion (5.231) reduces to a straight-forward matrix eigenvalue problem. The most 



Hx = E X 



(5.231) 




(5.232) 
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general solution can again be written 




) 



(5.233) 



Here, the ratio c + /c is determined by the matrix eigenvalue problem, and the 
wave-function "4>(r') is arbitrary. Clearly, the Hamiltonian determines the direc- 
tion of the particle's spin angular momentum, but leaves its position undeter- 
mined. 

In general, of course, the Hamiltonian is a function of both position and 
spin operators. In this case, it is not possible to decompose the spinor as in 
Eqs. (5.232) and (5.233). In other words, a general Hamiltonian causes the di- 
rection of the particle's spin angular momentum to vary with position in some 
specified manner. This can only be represented as a spinor involving different 
wave-functions, and 

But, what happens if we have a spin one or a spin three-halves particle? It 
turns out that we can generalize the Pauli two-component scheme in a fairly 
straight-forward manner. Consider a spin-s particle: i.e., a particle for which the 
eigenvalue of S 2 is s (s + 1 ) h 2 . Here, s is either an integer, or a half- integer. 
The eigenvalues of S z are written s z h, where s z is allowed to take the values 
s, s — 1 , • • • , — s + 1 , — s. In fact, there are 2 s + 1 distinct allowed values of s z . 
Not surprisingly, we can represent the state of the particle by 2 s + 1 different 
wave-functions, denoted i|) Sz (r'). Here, ip Sz (r') specifies the probability density 
for observing the particle at position r' with spin angular momentum s z h in the 
z-direction. More exactly, 



where ||A)) denotes a state ket in the product space of the position and spin op- 
erators. The state of the particle can be represented more succinctly by a spinor, 
X, which is simply the 2 s + 1 component row vector of the i\) Sz [r'). Thus, a spin 
one-half particle is represented by a two-component spinor, a spin one particle 
by a three-component spinor, a spin three-halves particle by a four-component 
spinor, and so on. 



= <r'l< 



s,s z ||A» 



(5.234) 
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In this extended Schrodinger/Pauli scheme, position space operators take the 
form of diagonal (2 s + 1 ) x (2 s + 1 ) matrix differential operators. Thus, we can 
represent the momentum operators as [see Eq. (5.214)] 

g 

p k ^-i?i — I, (5.235) 
ox k 

where I is the (2 s + 1 ) x (2 s + 1 ) unit matrix. We represent the spin operators as 

Sic — > s Picric, (5.236) 
where the (2 s + 1 ) x (2 s + 1 ) extended Pauli matrix cr k has elements 

(s,j|S k |s,l) 

°k ji = 1, • (5.237) 

s ri 

Here, j, I are integers, or half-integers, lying in the range — s to +s. But, how can 
we evaluate the brackets (s,j|S k |s,l) and, thereby, construct the extended Pauli 
matrices? In fact, it is trivial to construct the <r z matrix. By definition, 

S z |s,j) =j Fi|s,j). (5.238) 

Hence, 

( (s,j|S|s,l) = i 

s a s 

where use has been made of the ortho normality property of the |s,j). Thus, cr z 
is the suitably normalized diagonal matrix of the eigenvalues of S z . The matrix 
elements of cr x and cr y are most easily obtained by considering the shift operators, 

S ± = S x ±iS y . (5.240) 

We know, from Eqs. (5.55)-(5.56), that 



S + |sJ) = J s (s + 1)-j(j + 1) h|s,j + l), (5.241) 



S"|sJ) = Vs(s + 1)-j(j-1) h|s,j-l). (5.242) 
It follows from Eqs. (5.237), and (5.240)-(5.242), that 



v /s(s + l)-j(j-1)6 j)l+1 

I0~xjjl — ^ 

2s 
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S (S + 1 ) — j (j + 1 ) Sj.i-T 

2s 



s(s + 1)-j(j-1)5 j>l+1 
lis 



s(s + 1)-j(j + 1)5 j , l _ 1 
lis 



(5.243) 



(5.244) 



According to Eqs. (5.239) and (5.243)-(5.244), the Pauli matrices for a spin one- 
half (s = 1 /I) particle are 



0", 



ov 



1 

1 

-l 

1 

1 

-1 



(5.245) 
(5.246) 
(5.247) 



as we have seen previously. For a spin one (s = 1 ) particle, we find that 

(0 1 0\ 



1 



x/2 
1 

VI 



V 



V 1 



cr 7 



1 1 

1 

-i \ 

1 -i 
i 

/I \ 




0-1 



(5.248) 



(5.249) 



(5.250) 



In fact, we can now construct the Pauli matrices for a spin anything particle. 
This means that we can convert the general energy eigenvalue problem for a 
spin-s particle, where the Hamiltonian is some function of position and spin op- 
erators, into 2s + 1 coupled partial differential equations involving the 2s + 1 
wave-functions ip Sz (r / ). Unfortunately, such a system of equations is generally 
too complicated to solve exactly. 
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5.14 Addition of angular momentum 

Consider a hydrogen atom in an I = 1 state. The electron possesses orbital angu- 
lar momentum of magnitude h, and spin angular momentum of magnitude h/2. 
So, what is the total angular momentum of the system? 

In order to answer this question, we are going to have to learn how to add 
angular momentum operators. Let us consider the most general case. Suppose 
that we have two sets of angular momentum operators, Ji and J 2 . By definition, 
these operators are Hermitian, and obey the fundamental commutation relations 

Ji x Ji = i Ft J b (5.251) 
J 2 x J 2 = i hJ 2 . (5.252) 

We assume that the two groups of operators correspond to different degrees of 
freedom of the system, so that 

[Ju, Jzj] = 0, (5.253) 

where i, j stand for either x, y, or z. For instance, Ji could be an orbital angular 
momentum operator, and J 2 a spin angular momentum operator. Alternatively, Ji 
and J 2 could be the orbital angular momentum operators of two different parti- 
cles in a multi-particle system. We know, from the general properties of angular 
momentum, that the eigenvalues of } 2 and J 2 2 can be written ji (ji + 1) h 2 and 
)i Ih + 1 ) ft > respectively, where ji and j 2 are either integers, or half-integers. We 
also know that the eigenvalues of Ji z and J 2z take the form mi h and m 2 h, respec- 
tively, where mi and m 2 are numbers lying in the ranges j 1 , j 1 — 1 , • • • , — j 1 + 1 , — j 1 
and j 2 , j 2 - 1 , • • • , -j 2 + 1 , -j 2 , respectively. 

Let us define the total angular momentum operator 

J = Ji+J 2 . (5.254) 

Now J is an Hermitian operator, since it is the sum of Hermitian operators. Ac- 
cording to Eqs. (5.11) and (5.14), J satisfies the fundamental commutation rela- 
tion 

JxJ = ihJ. (5.255) 
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Thus, J possesses all of the expected properties of an angular momentum opera- 
tor. It follows that the eigenvalue of J 2 can be written j (j + 1 ) ft 2 , where j is an 
integer, or a half-integer. The eigenvalue of J z takes the form m ft, where m lies 
in the range j , j — 1 , • • • , — j + 1 , — j . At this stage, we do not know the relationship 
between the quantum numbers of the total angular momentum, j and m, and 
those of the individual angular momenta, ji, j 2 , mi, and m^. 

Now 

J 2 = Ji 2 + J2 2 + 2JtJ 2 . (5.256) 

We know that 

[J 2 ,Ju] = 0, (5.257) 
[J 2 2 ,J2i] = 0, (5.258) 

and also that all of the Ju operators commute with the hi operators. It follows 
from Eq. (5.256) that 

[J 2 ,J 1 2 ] = [J 2 ,J 2 2 ]=0. (5.259) 

This implies that the quantum numbers ji, )i, and ) can all be measured simulta- 
neously. In other words, we can know the magnitude of the total angular momen- 
tum together with the magnitudes of the component angular momenta. However, 
it is clear from Eq. (5.256) that 

[J 2 , JiJ ^ 0, (5.260) 
[J 2 ,hz] ^ 0. (5.261) 

This suggests that it is not possible to measure the quantum numbers mi and 
m.2 simultaneously with the quantum number j . Thus, we cannot determine the 
projections of the individual angular momenta along the z-axis at the same time 
as the magnitude of the total angular momentum. 

It is clear, from the preceding discussion, that we can form two alternate 
groups of mutually commuting operators. The first group is J^, J 2 2 , Ji z , and }i z . 
The second group is J 1 2 ,J 2 2 ,J 2 , and J z . These two groups of operators are in- 
compatible with one another. We can define simultaneous eigenkets of each 
operator group. The simultaneous eigenkets of Ji 2 , J 2 2 , Ji z , and J 2z are denoted 
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|jij2;mi,m 2 ), where 

Ji 2 |ji,j2;rrii,rri2) = h (ji + 1) ^ 2 Iji, j2;mi,m 2 >, (5.262) 

J2 2 b'i,i2;m 1 ,m 2 ) = j 2 (j 2 + 1) ft 2 |ji J 2 ;mi,m 2 ), (5.263) 

Ji z |ji,j2;m 1 ,m 2 ) = m] rtlj], j 2 ;m 1 ,m 2 ), (5.264) 

J2 z |ji,j2;m 1 ,m 2 ) = m 2 rtlj] J 2 ;m 1 ,m 2 ). (5.265) 

The simultaneous eigenkets of J^, J 2 2 , J 2 and J z are denoted |ji , j 2 ; j, m), where 

Ti 2 |ji , jz; j, rrx> = U (j] + 1) rt 2 |ji,j 2 ;j,m), (5.266) 

J 2 2 lii,i2;i,m) = j 2 (j 2 + 1) h 2 \h ,j 2 ;j,m), (5.267) 

J 2 lii,j2;j,m) = ) {) + 1) Tx 2 |j 7 ,j 2 ;j,m), (5.268) 

J z |ji,i2;j,m) = m rL|j 1 J 2 ;j,m). (5.269) 

Each set of eigenkets are complete, mutually orthogonal (for eigenkets corre- 
sponding to different sets of eigenvalues), and have unit norms. Since the op- 
erators ] } 2 and J 2 2 are common to both operator groups, we can assume that the 
quantum numbers ji and j 2 are known. In other words, we can always determine 
the magnitudes of the individual angular momenta. In addition, we can either 
know the quantum numbers mi and m 2 , or the quantum numbers ) and m, but 
we cannot know both pairs of quantum numbers at the same time. We can write 
a conventional completeness relation for both sets of eigenkets: 

Y Y j2;m-i,m 2 ) (jbfamT, m 2 | = 1, (5.270) 

2 

^^\h,h\h™)(h,h',h™\ = 1, (5.271) 



mi m.2 



m 



where the right-hand sides denote the identity operator in the ket space corre- 
sponding to states of given ji and j 2 . The summation is over all allowed values of 
mi, m 2 , j, and m. 

The operator group J^, J 2 2 , J 2 , and J z is incompatible with the group J^, J 2 2 , 
Ji z , and J 2z . This means that if the system is in a simultaneous eigenstate of the 
former group then, in general, it is not in an eigenstate of the latter. In other 
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words, if the quantum numbers ji, j 2 , j, and m are known with certainty, then a 
measurement of the quantum numbers mi and m.2 will give a range of possible 
values. We can use the completeness relation (5.270) to write 

Ijij2;j>m) = Y Y (ji , j 2 ; ttm , m 2 |ji , j 2 ; 3 , m) [ji , j 2 ; ttm , m 2 ) . (5.272) 

mi m2 

Thus, we can write the eigenkets of the first group of operators as a weighted 
sum of the eigenkets of the second set. The weights, (ji, j 2 ;mi,m 2 |ji, j 2 ;j,m), 
are called the Clebsch-Gordon coefficients. If the system is in a state where a 
measurement of Ji 2 , J 2 2 , J 2 , and J z is bound to give the results ji (ji + 1 ) ft 2 , j 2 (j 2 + 
1) ft 2 ,j (j + 1) ft 2 , and j z ft, respectively, then a measurement of Ji z and J 2z will 
give the results mi ft and m 2 ft with probability | (j i , j 2 ; mi , m 2 |j i , j 2 ; j , m) | 2 . 

The Clebsch-Gordon coefficients possess a number of very important proper- 
ties. First, the coefficients are zero unless 

m = mi + m 2 . (5.273) 

To prove this, we note that 

(Jz - Jiz - J 2 z)lji, j 2 ;j,m) = 0. (5.274) 

Forming the inner product with (j i , j 2 ; mi , m 2 |, we obtain 

(m-mi -m 2 )(ji,j 2 ;mi,m 2 |ji,j 2 ;j,m) = 0, (5.275) 

which proves the assertion. Thus, the z-components of different angular mo- 
menta add algebraically. So, an electron in an I = 1 state, with orbital angular 
momentum ft, and spin angular momentum Pi/2, projected along the z-axis, con- 
stitutes a state whose total angular momentum projected along the z-axis is 3 ft/2. 
What is uncertain is the magnitude of the total angular momentum. 

Second, the coefficients vanish unless 

\h-h\ < ) < ji+32. (5.276) 

We can assume, without loss of generality, that j i > j 2 . We know, from Eq. (5.273), 
that for given ji and j 2 the largest possible value of m is ji + j 2 (since ji is the 
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largest possible value of mi, etc.). This implies that the largest possible value of j 
is j i + )i (since, by definition, the largest value of m is equal to j). Now, there are 
(2 ji + 1 ) allowable values of mi and (2 j 2 + 1 ) allowable values of m 2 . Thus, there 
are (2 ji + 1 ) (2 j 2 + 1 ) independent eigenkets, |ji, j2;Tni,m2), needed to span the 
ket space corresponding to fixed ji and j 2 . Since the eigenkets j 2 ;j>in) span 
the same space, they must also form a set of (2 ji + 1 ) (2 )i + 1 ) independent kets. 
In other words, there can only be (2 ji + 1 ) (2 j 2 + 1 ) distinct allowable values of 
the quantum numbers j and m. For each allowed value of j, there are 2j + 1 
allowed values of m. We have already seen that the maximum allowed value of 
j is ji + )i. It is easily seen that if the minimum allowed value of j is ji — )i then 
the total number of allowed values of j and m is (2 j i + 1 ) (2)2 + 1): i.e., 

(2j + 1) = (2j 1 + 1)(2j 2 + 1). (5.277) 

Hi -3*2 

This proves our assertion. 

Third, the sum of the modulus squared of all of the Clebsch-Gordon coeffi- 
cients is unity: i.e., 

Y Y I(ji,j2;mi,m 2 |ji, j 2 ;j,m)| 2 = 1. (5.278) 

mi m.2 

This assertion is proved as follows: 

(ji,j2;j,m|ji,j 2 ;j,m) = 

mi m.2 

= HHlOi J2;mi,m 2 |ji,j2;j,m)| 2 = 1, (5.279) 

mi m.2 

where use has been made of the completeness relation (5.270). 

Finally, the Clebsch-Gordon coefficients obey two recursion relations. To ob- 
tain these relations we start from 

J^iJi-J.m) = (Jf + Jj) (5.280) 

x ^2^2<ji,j2;m 1 / ) m 2 |ji ) j2;j,m)|ji ) j 2 ;m 1 / ,m2>. 

mj m 2 
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Making use of the well-known properties of the shift operators, which are speci- 
fied by Eqs. (5.55)-(5.56), we obtain 

yji (i + 1) -m(m± 1) |ji J 2 ;j,m± 1) = 
Y_ Y_ (VMii + lj-mj (mj ±1) j 2 ;mi ± l,m^> 

+y/h ()2 + 1) -m^(m^± 1) |j 1 ,j 2 ;m 1 / ,m2 ±1)) 

X (j 1 , h\ TTlJ , TTL^Ij 1 , )i\ ) , m) . (5.281) 

Taking the inner product with (j i , j 2 ; mi , m 2 |, and making use of the orthonormal- 
ity property of the basis eigenkets, we obtain the desired recursion relations: 

y/j (j + 1) -m(m± 1) (ji,j 2 ;m l ,m 2 |j 1 J 2 ;j,m±1) = 

y/h (ji + 1) - mi (mi t 1) (h J2;m! T l,m 2 |j 1 ,j 2 ;j,m) 

+V)2(j2 + 1) -m 2 (m 2 =F 1) (iij2;mi,m 2 =p j2;j»m>. (5.282) 

It is clear, from the absence of complex coupling coefficients in the above re- 
lations, that we can always choose the Clebsch-Gordon coefficients to be real 
numbers. This is a convenient choice, since it ensures that the inverse Clebsch- 
Gordon coefficients, (ji, j 2 ;j,m|ji, j 2 ;mi,m 2 ), are identical to the Clebsch-Gordon 
coefficients. In other words, 

(ji, j2;j,m|j 1 , j 2 ;m 1 ,m 2 ) = (ji, j 2 ;m 1 ,m 2 |j 1 , j 2 ;j,m). (5.283) 

The inverse Clebsch-Gordon coefficients are the weights in the expansion of the 
|j i > j2"> ttli > m 2 ) in terms of the |ji ,j 2 ;j, m): 

|j i , j 2 ; rm , m 2 > = Y_ Y. Q 1 ' ^ ' ' m ^ 1 ' ^ mi ' m2 ^ ^ 1 ' ^ (5.284) 

j m 

It turns out that the recursion relations (5.282), together with the normal- 
ization condition (5.278), are sufficient to completely determine the Clebsch- 
Gordon coefficients to within an arbitrary sign (multiplied into all of the coeffi- 
cients). This sign is fixed by convention. The easiest way of demonstrating this 
assertion is by considering some specific examples. 
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Let us add the angular momentum of two spin one-half systems: e.g., two 
electrons at rest. So, ji = j 2 = 1/2. We know, from general principles, that 
|mi| < 1/2 and |m 2 | < 1/2. We also know, from Eq. (5.276), that < j < 1, 
where the allowed values of j differ by integer amounts. It follows that either 
j = or j = 1 . Thus, two spin one-half systems can be combined to form either a 
spin zero system or a spin one system. It is helpful to arrange all of the possibly 
non-zero Clebsch-Gordon coefficients in a table: 



mi 


TTL2 










1/2 


1/2 


? 


? 


? 


? 


1/2 


-1/2 


? 


? 


? 


? 


-1/2 


1/2 


? 


? 


? 


? 


-1/2 


-1/2 


? 


? 


? 


? 




j 


1 


1 


1 







m 


1 





-1 






The box in this table corresponding to mi = 1/2, m 2 = 1 /2, j = 1 , m = 1 
gives the Clebsch-Gordon coefficient (1 /2, 1 /2; 1 /2, 1 /2|1 /2, 1 /2; 1,1), or the in- 
verse Clebsch-Gordon coefficient (1/2, 1/2; 1 , 1 11/2, 1/2; 1/2, 1 /2). All the boxes 
contain question marks because we do not know any Clebsch-Gordon coefficients 
at the moment. 

A Clebsch-Gordon coefficient is automatically zero unless mi + m 2 = m. In 
other words, the z-components of angular momentum have to add algebraically. 
Many of the boxes in the above table correspond to mi +m 2 ^ m. We immediately 
conclude that these boxes must contain zeroes: i.e., 



mi 


m 2 










1/2 


1/2 


? 











1/2 


-1/2 





? 





? 


-1/2 


1/2 





? 





? 


-1/2 


-1/2 








? 





ji=1/2 


j 


1 


1 


1 





i2=i/2 


m 


1 





-1 






The normalization condition (5.278) implies that the sum of the squares of all 
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the rows and columns of the above table must be unity. There are two rows and 
two columns which only contain a single non-zero entry. We conclude that these 
entries must be ±1, but we have no way of determining the signs at present. 
Thus, 



mi 


m,2 










1/2 


1/2 


±1 











1/2 


-1/2 





? 





? 


-1/2 


1/2 





? 





? 


-1/2 


-1/2 








±1 





j i=V2 


) 


1 


1 


1 





32=1/2 


m 


1 





-1 






Let us evaluate the recursion relation (5.282) for ji = )i = 1/2, with j = 1 
m = 0, mi = TTi2 = ±1/2, taking the upper/lower sign. We find that 



;i /2, -1 /2| 1,0) + (-1 /2, 1 /2| 1 , 0) = v 7 ! (1 /2, 1 /2| 1 , 1 ) = ±\/2, (5.285) 



and 



(1/2,-1/2|1,0) + (-1/2,1/2|1,0) = v 7 ! (-1/2,-1/2|1, -1) = ±v / 2. (5.286) 

Here, the ji and j2 labels have been suppressed for ease of notation. We also 
know that 

(1/2, -1/2|1 , 0) 2 + (-1/2, 1/2|1 , 0) 2 = 1 , (5.287) 

from the normalization condition. The only real solutions to the above set of 
equations are 

Vl (1/2,-1/211,0) = ^(-1/2,1/211,0) 

= (1/2,1/2|1,1) = (1/2,1/2|1,-1) =±1. (5.288) 

The choice of sign is arbitrary — the conventional choice is a positive sign. Thus, 
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our table now reads 



mi 


TTL2 










1/2 


1/2 


1 











1/2 


-1/2 










? 


-1/2 


1/2 





1/V2 





? 


-1/2 


-1/2 








1 





ji=1/2 


j 


1 


1 


1 





>2 = 1/2 


m 


1 





-1 






We could fill in the remaining unknown entries of our table by using the re- 
cursion relation again. However, an easier method is to observe that the rows 
and columns of the table must all be mutually orthogonal. That is, the dot prod- 
uct of a row with any other row must be zero. Likewise, for the dot product of 
a column with any other column. This follows because the entries in the table 
give the expansion coefficients of one of our alternative sets of eigenkets in terms 
of the other set, and each set of eigenkets contains mutually orthogonal vectors 
with unit norms. The normalization condition tells us that the dot product of a 
row or column with itself must be unity. The only way that the dot product of the 
fourth column with the second column can be zero is if the unknown entries are 
equal and opposite. The requirement that the dot product of the fourth column 
with itself is unity tells us that the magnitudes of the unknown entries have to 
be 1 /y/l. The unknown entries are undetermined to an arbitrary sign multiplied 
into them both. Thus, the final form of our table (with the conventional choice 
of arbitrary signs) is 



mi 


m.2 










1/2 


1/2 


1 











1/2 


-1/2 





1/V2 





1A/2 


-1/2 


1/2 





^/V2 





-1A/2 


-1/2 


-1/2 








1 





)i=1/2 


) 


1 


i 


1 





32=1/2 


m 


1 





-1 






The table can be read in one of two ways. The columns give the expansions of 
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the eigenstates of overall angular momentum in terms of the eigenstates of the 
individual angular momenta of the two component systems. Thus, the second 
column tells us that 

|1 ,0) = -±= ( 1 1/2, -1/2) + I - 1/2, 1/2) ) . (5.289) 

The ket on the left-hand side is a |j, m) ket, whereas those on the right-hand side 
are |mi,m2) kets. The rows give the expansions of the eigenstates of individual 
angular momentum in terms of those of overall angular momentum. Thus, the 
second row tells us that 

|1/2,-1/2) = -L( |1,0) + |0,0)). (5.290) 

Here, the ket on the left-hand side is a |mi , m.2) ket, whereas those on the right- 
hand side are |j, m) kets. 

Note that our table is really a combination of two sub-tables, one involving 
j = states, and one involving j = 1 states. The Clebsch-Gordon coefficients cor- 
responding to two different choices of j are completely independent: i.e., there is 
no recursion relation linking Clebsch-Gordon coefficients corresponding to differ- 
ent values of j. Thus, for every choice of ji, j 2 , and j we can construct a table of 
Clebsch-Gordon coefficients corresponding to the different allowed values of mi , 
m.2, and m (subject to the constraint that mi + m.2 = m) . A complete knowledge 
of angular momentum addition is equivalent to a knowing all possible tables of 
Clebsch-Gordon coefficients. These tables are listed (for moderate values of ji, )i 
and j) in many standard reference books. 
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6.1 Introduction 

We have developed techniques by which the general energy eigenvalue problem 
can be reduced to a set of coupled partial differential equations involving various 
wave-functions. Unfortunately the number of such problems which yield exactly 
soluble equations is comparatively small. Clearly we need to develop some tech- 
niques for finding approximate solutions to otherwise intractable problems. 

Consider the following problem, which is very common. The Hamiltonian of a 
system is written 

H^Ho + Hl (6.1) 

Here, Ho is a simple Hamiltonian for which we know the exact eigenvalues and 
eigenstates. Hi introduces some interesting additional physics into the problem, 
but it is sufficiently complicated that when we add it to Ho we can no longer 
find the exact energy eigenvalues and eigenstates. However, Hi can, in some 
sense (which we shall specify more exactly later on), be regarded as being small 
compared to Ho. Can we find the approximate eigenvalues and eigenstates of the 
modified Hamiltonian, Ho + Hi, by performing some sort of perturbation analysis 
about the eigenvalues and eigenstates of the original Hamiltonian, Ho? Let us 
investigate. 

6.2 The two-state system 

Let us begin by considering time-independent perturbation theory, in which the 
modification to the Hamiltonian, Hi, has no explicit dependence on time. It is 
usually assumed that the unperturbed Hamiltonian, Ho, is also time-independent. 

Consider the simplest non-trivial system, in which there are only two indepen- 
dent eigenkets of the unperturbed Hamiltonian. These are denoted 

H |1> = Ei|l), (6.2) 
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Ho |2) = E 2 |2). (6.3) 

It is assumed that these states, and their associated eigenvalues, are known. Since 
Ho is, by definition, an Hermitian operator, its two eigenkets are orthonormal and 
form a complete set. The lengths of these eigenkets are both normalized to unity. 
Let us now try to solve the modified energy eigenvalue problem 

(Ho + Hi)|E) = E |E). (6.4) 

In fact, we can solve this problem exactly. Since, the eigenkets of Ho form a 
complete set, we can write 

1|E)|1) + (2|E)|2). (6.5) 



Right-multiplication of Eq. (6.4) by (1| and (2| yields two coupled equations, 
which can be written in matrix form: 



-E + en e 12 W(1|E)\ (0 
e{ 2 E 2 -E + e 22 J I (2|E) J lo 



(6.6) 



Here, 



en = (HHtH), (6.7) 
e 22 = (2|H!|2), (6.8) 
e 12 = (11^12). (6.9) 

In the special (but common) case of a perturbing Hamiltonian whose diagonal 
matrix elements (in the unperturbed eigenstates) are zero, so that 

en=e 22 = 0, (6.10) 

the solution of Eq. (6.6) (obtained by setting the determinant of the matrix equal 
to zero) is 



2 

Let us expand in the supposedly small parameter 

|ei 2 | 



(Ei + E 2 ) ± V(Ei - E 2 ) 2 + 4|e 12 | 2 

. (6.11) 



|Ei-E 2 | 
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We obtain 

E ~ 1(E! + E 2 ) ± l(Ei - E 2 )(l + 2 e 2 + • • •)• (6.13) 

The above expression yields the modifications to the energy eigenvalues due to 
the perturbing Hamiltonian: 

E; = E 1 + J 6l2 ^ + ---, (6.14) 
ti — t 2 

E 2 = E 2 -^ ei2 ^ + ---. (6.15) 
ti — t 2 

Note that Hi causes the upper eigenvalue to rise, and the lower eigenvalue to 
fall. It is easily demonstrated that the modified eigenkets take the form 

11) ' = n> + ^ e V|2) + --- ) (6.16) 

ti — t 2 

12) ' = |2)--^-|1) + --.. (6.17) 

ti — t 2 

Thus, the modified energy eigenstates consist of one of the unperturbed eigen- 
states with a slight admixture of the other. Note that the series expansion in 
Eq. (6.13) only converges if 2\e\ < 1. This suggests that the condition for the 
validity of the perturbation expansion is 

led < |El ~ E2 ' . (6.18) 

In other words, when we say that Hi needs to be small compared to Ho, what we 
really mean is that the above inequality needs to be satisfied. 



6.3 Non-degenerate perturbation theory 

Let us now generalize our perturbation analysis to deal with systems possessing 
more than two energy eigenstates. The energy eigenstates of the unperturbed 
Hamiltonian, Ho, are denoted 

Ho |n) = En |n), (6.19) 
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where n runs from 1 to N . The eigenkets |n) are orthonormal, form a complete 
set, and have their lengths normalized to unity. Let us now try to solve the energy 
eigenvalue problem for the perturbed Hamiltonian: 

(Ho + H^IE) =E|E). (6.20) 

We can express |E) as a linear superposition of the unperturbed energy eigenkets, 

|E) = ^Jk|E)|k), (6.21) 

k 

where the summation is from k = 1 to N. Substituting the above equation into 
Eq. (6.20), and right-multiplying by (m|, we obtain 

(E m + e mm - E) (m|E) + emk(k|E) = 0, (6.22) 

k^m. 

where 

e mk = (m|Hi|k>. (6.23) 

Let us now develop our perturbation expansion. We assume that 

0(e), (6.24) 
for all m/k, where e <C 1 is our expansion parameter. We also assume that 

^~0(e), (6.25) 



-m 



for all m. Let us search for a modified version of the nth unperturbed energy 
eigenstate, for which 

E = E n + 0(e), (6.26) 

and 

(n|E> = 1, (6.27) 
(m|E) ~ 0(e), (6.28) 

for m 7^ n. Suppose that we write out Eq. (6.22) for m ^ n, neglecting terms 
which are 0(e 2 ) according to our expansion scheme. We find that 

(E m - En) (m|E) + e mn ~ 0, (6.29) 
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giving 

(m|E) ~ - v ^ r . (6.30) 

Substituting the above expression into Eq. (6.22), evaluated for m = n, and 
neglecting 0(e 3 ) terms, we obtain 

(En + e nn - E) - Y_ -JH^- = 0. (6.31) 

Thus, the modified nth energy eigenstate possesses an eigenvalue 

E; = E n + e nn + V -}^- + 0(e 3 ), (6.32) 
feu En ~ Ek 

and a eigenket 

|n> 7 = |n) + V" — ^— He) + 0(e 2 ). (6.33) 
fen E ^ ~ Ek 

Note that 

(m|n)' = 6 mn + + + 0(e 2 ) = 6 mn + 0(e 2 ). (6.34) 

t-m '-u t-n tra 

Thus, the modified eigenkets remain orthonormal and properly normalized to 

0(e 2 ). 



6.4 The quadratic Stark effect 

Suppose that a one-electron atom [i.e., either a hydrogen atom, or an alkali metal 
atom (which possesses one valance electron orbiting outside a closed, spher- 
ically symmetric shell)] is subjected to a uniform electric field in the positive 
z-direction. The Hamiltonian of the system can be split into two parts. The un- 
perturbed Hamiltonian, 

H = ^- + V(r), (6.35) 
2m e 

and the perturbing Hamiltonian, 

Hi=e|E|z. (6.36) 
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It is assumed that the unperturbed energy eigenvalues and eigenstates are 
completely known. The electron spin is irrelevant in this problem (since the spin 
operators all commute with HO, so we can ignore the spin degrees of freedom of 
the system. This implies that the system possesses no degenerate energy eigen- 
values. This is not true for the n ^ 1 energy levels of the hydrogen atom, due to 
the special properties of a pure Coulomb potential. It is necessary to deal with 
this case separately, because the perturbation theory presented in Sect. 6.3 breaks 
down for degenerate unperturbed energy levels. 

An energy eigenket of the unperturbed Hamiltonian is characterized by three 
quantum numbers — the radial quantum number n, and the two angular quantum 
numbers I and m (see Sect. 5.6). Let us denote such a ket |u, l,m), and let its 
energy level be E nlm . According to Eq. (6.32), the change in this energy level 
induced by a small electric field is given by 

AE nlm = e |E| (n, I, m|z|n, I, m) 

(n, ^mlzln/l',™,'^ 2 



Now, since 
it follows that 
Thus, 
giving 



+e 2 |E|2 y L > IIL /' . (6.37) 



,i',l',m^n,l,m ~ nlm E ^' 1 '™' 



E z = xpy--yp x , (6.38) 
[L z ,z]=0. (6.39) 
(n } l } m\[U,z\\n' } l',m') = 0, (6.40) 



(m- m'){n } l } m\z\n' ,1' } m') = 0, (6.41) 

since |n, I, m) is, by definition, an eigenstate of L z with eigenvalue m h. It is 
clear, from the above relation, that the matrix element (n, I, mlzlu', V, m') is zero 
unless m' = m. This is termed the selection rule for the quantum number m. 

Let us now determine the selection rule for I. We have 

[L 2 ,z] = [L x 2 ,z] + [L 2 z] 
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= L x [L x ,z] + [L X ,Z]L X +Ly[Ly,Z] + [L y ,z]L y 

= i?i(-L x u -y L x + L y x + xL y ) 
= 2i Ti (L^ x — L x u + i Piz) 

= 2ift(L y x--yL x ) = 2iTi(xL y -L x -y), (6.42) 

where use has been made of Eqs. (5.1)-(5.6). Similarly, 

[L 2 ,-y] = 2iTi(L x z-xL 2 ), (6.43) 

[L 2 ,x] = 2iri(uL z -L y z). (6.44) 

Thus, 

[L 2 ,[L 2 ,z]] = 2ih(L 2 ,L y x-L x u + iFiz) 

= 2 i Pi (L v [L 2 , x] - L X [L 2 , y] + i Pi [L 2 , z]) , 
= -4h 2 L y (uL z -L y z)+4h 2 L x (L x z-xL z ) 

-2h 2 (L 2 z-zL 2 ). (6.45) 

This reduces to 

[L 2 ,[L 2 ,z]] = -h 2 [4(L x x + L y -y + L z z)L z -4(L x 2 + L v 2 + L z 2 )z 

+2(L 2 z-zL 2 )] . (6.46) 

However, it is clear from Eqs. (5.1)-(5.3) that 

L x x + L y -y + L z z = 0. (6.47) 

Hence, we obtain 

[L 2 ,[L 2 ,z]] =2h 2 (L 2 z + zL 2 ). (6.48) 
Finally, the above expression expands to give 

L 4 z-2L 2 zL 2 + zL 4 -2 Pi 2 (L 2 z + zL 2 ) =0. (6.49) 

Equation (6.49) implies that 

(u, I, m|L 4 z — 2 L 2 z L 2 + z L 4 — 2 Ft 2 (L 2 z + z L 2 ) |u', I', m'} = 0. (6.50) 
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This expression yields 

[l 2 (l + 1) 2 -2l(l+1)l'(l' + 1) + l' 2 (l' + 1) 2 
-21(1+1)- 21' (l / + 1)](n,l,m|z|n / ,l / ,m / ) = 0, (6.51) 

which reduces to 

(I + 1' + 2) (I + 1') (I - 1' + 1 ) (I - 1' - 1 ) (n, I, m|z|n', I', m'} = 0. (6.52) 

According to the above formula, the matrix element (n, I, m|z|n', I', m/) vanishes 
unless I = I' = or I' = I ± 1 . This matrix element can be written 



(n, I, mlzln', I', m') 



CUr', 9', cp') r'cos e'llw^r', 9', cp') dV, (6.53) 



where ipniml 1 "') — ( r/ l n > l> m )- Recall, however, that the wave-function of an I = 
state is spherically symmetric (see Sect. 5.3): i.e., ij>noo(i") = iKooIt')- It follows 
from Eq. (6.53) that the matrix element vanishes by symmetry when I = I' = 0. 
In conclusion, the matrix element (n, I, m|z|n', I', m') is zero unless I' = l± 1. 
This is the selection rule for the quantum number I. 

Application of the selection rules to Eq. (6.37) yields 

AE nlm = S |Ef Y. Z l<n F l,m|Z l n '' V ' m> ' 2 - (6-54) 

n' l'=l±1 nlm Ln ' l ' m 

Note that all of the terms in Eq. (6.37) which vary linearly with the electric field- 
strength vanish by symmetry, according to the selection rules. Only those terms 
which vary quadmtically with the field-strength survive. The polarizability of an 
atom is defined in terms of the energy-shift of the atomic state as follows: 

AE = --cc|E| 2 . (6.55) 

Consider the ground state of a hydrogen atom. (Recall, that we cannot address 
the n > 1 excited states because they are degenerate, and our theory cannot 
handle this at present) . The polarizability of this state is given by 

|(1,0,0Nn,1,0)| 2 r ,„ 
cx = 2 e } — - - —. (6.56) 

~[ E n0 o — E 10 o 
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Here, we have made use of the fact that E n io = E n oo for a hydrogen atom. 

The sum in the above expression can be evaluated approximately by noting 
that [see Eq. (5.120)] 

e 2 

EnOO = — o 7 (6.57) 

87ieo cio n z 

for a hydrogen atom, where 

a = 47r£ o ft = 5 3 x 10 -n meters (6.58) 

is the Bohr radius. We can write 

3 e 2 

E n oo ~~ ^100 > ^200 — ^100 = ~7 t> • (6.59) 

487teo ao 

Thus, 

oc< ^47X60 a ^ 1(1,0, 0|z|u,1,0)| 2 . (6.60) 

3 n>1 

However, 

^|(1,0,0|z|n,1,0}| 2 = Y_ (1,0,0|z|n / ,l / ,m / )(n / ,m / ,l / |z|l,0,0) 

n>1 u',l',m.' 

= (1,0,0|z 2 |1,0,0>, (6.61) 

where we have made use of the fact that the wave-functions of a hydrogen atom 
form a complete set. It is easily demonstrated from the actual form of the ground 
state wave-function that 

(1,0,0|z 2 |1,0,0> = a 2 . (6.62) 

Thus, we conclude that 

oc < 47te a 3 ~ 5.347te a 3 . (6.63) 

The true result is 

oc = - 4ne a,Q = 4.5 47te a . (6.64) 

It is actually possible to obtain this answer, without recourse to perturbation the- 
ory, by solving Schrodinger's equation exactly in parabolic coordinates. 
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6.5 Degenerate perturbation theory 

Let us now consider systems in which the eigenstates of the unperturbed Hamil- 
tonian, Ho, possess degenerate energy levels. It is always possible to represent 
degenerate energy eigenstates as the simultaneous eigenstates of the Hamilto- 
nian and some other Hermitian operator (or group of operators). Let us denote 
this operator (or group of operators) L. We can write 

Ho |n,l) = E n |n, I), (6.65) 

and 

L|n,l) = L nl |n,l), (6.66) 

where [Ho, L] = 0. Here, the E n and the L nl are real numbers which depend on 
the quantum numbers n, and n and I, respectively. It is always possible to find a 
sufficient number of operators which commute with the Hamiltonian in order to 
ensure that the L nl are all different. In other words, we can choose L such that 
the quantum numbers n and I uniquely specify each eigenstate. Suppose that for 
each value of n there are N n different values of I: i.e., the nth energy eigenstate 
is N n -fold degenerate. 

In general, L does not commute with the perturbing Hamiltonian, Hi. This 
implies that the modified energy eigenstates are not eigenstates of L. In this situ- 
ation, we expect the perturbation to split the degeneracy of the energy levels, so 
that each modified eigenstate |n, I}' acquires a unique energy eigenvalue E^. Let 
us naively attempt to use the standard perturbation theory of Sect. 6.3 to evalu- 
ate the modified energy eigenstates and energy levels. A direct generalization of 
Eqs. (6.32) and (6.33) yields 

Ki = Hn + e nlnl + Y. l 6n Tf +°( £3 )' ( 6 - 67 ) 

n',l'?£n,l n n ' 

and 

|n,l) , = |n,l)+ Y_ 6n ^ 1 |n',l') + 0(e 2 ), (6.68) 

where 

ena'niHTi'.l'lHiM). (6.69) 
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It is fairly obvious that the summations in Eqs. (6.67) and (6.68) are not well- 
behaved if the nth energy level is degenerate. The problem terms are those 
involving unperturbed eigenstates labeled by the same value of n, but different 
values of I: i.e., those states whose unperturbed energies are E n . These terms 
give rise to singular factors 1/(E n — E n ) in the summations. Note, however, that 
this problem would not exist if the matrix elements, envni, of the perturbing 
Hamiltonian between distinct, degenerate, unperturbed energy eigenstates cor- 
responding to the eigenvalue E n were zero. In other words, if 

(n,l'|Hi|n,l) = A nl 6 U ', (6.70) 

then all of the singular terms in Eqs. (6.67) and (6.68) would vanish. 

In general, Eq. (6.70) is not satisfied. Fortunately, we can always redefine the 
unperturbed energy eigenstates belonging to the eigenvalue E n in such a manner 
that Eq. (6.70) is satisfied. Let us define N n new states which are linear combina- 
tions of the l\! n original degenerate eigenstates corresponding to the eigenvalue 

Nn 

|n,l (1) ) = ^(u,k|u,l m )|u,k). (6.71) 

k=1 

Note that these new states are also degenerate energy eigenstates of the unper- 
turbed Hamiltonian corresponding to the eigenvalue E n . The |n, are chosen 
in such a manner that they are eigenstates of the perturbing Hamiltonian, Hi. 
Thus, 

Hi |n,l m ) = A nl |n,l (1) ). (6.72) 

The |n,l (1) ) are also chosen so that they are orthonormal, and have unit lengths. 
It follows that 

(u,l ,(1) |H 1 |n,l (1) ) =A ul 6 lF . (6.73) 

Thus, if we use the new eigenstates, instead of the old ones, then we can employ 
Eqs. (6.67) and (6.68) directly, since all of the singular terms vanish. The only 
remaining difficulty is to determine the new eigenstates in terms of the original 
ones. 
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Now 

Nn 

2jn,l)(n,l| = 1, (6.74) 

where 1 denotes the identity operator in the sub-space of all unperturbed energy 
eigenkets corresponding to the eigenvalue E n . Using this completeness relation, 
the operator eigenvalue equation (6.72) can be transformed into a straightfor- 
ward matrix eigenvalue equation: 

Y_ (n, l/|Hi |n, I") (n, l"|n, = Ki (n, l'|n, (6.75) 
i"=i 

This can be written more transparently as 

Ux = Ax, (6.76) 

where the elements of the N n x N n Hermitian matrix U are 

U jk = (nJlHiKk). (6.77) 

Provided that the determinant of U is non-zero, Eq. (6.76) can always be solved 
to give N n eigenvalues A nl (for I = 1 to N n ), with l\l n corresponding eigenvectors 
x nl . The eigenvectors specify the weights of the new eigenstates in terms of the 
original eigenstates: i.e., 

(x nl ) k =(n,k|n,l (1) ), (6.78) 
for k = 1 to N n . In our new scheme, Eqs. (6.67) and (6.68) yield 

E; i = E n + A nl + Y J^lt + o(e 3 ), (6.79) 

and 

|n,l (1 V = |n,l (1) )+ V -^^InMO + Ofe 2 ). (6.80) 

There are no singular terms in these expressions, since the summations are over 
n' 7^ n: i.e., they specifically exclude the problematic, degenerate, unperturbed 
energy eigenstates corresponding to the eigenvalue E n . Note that the first-order 
energy shifts are equivalent to the eigenvalues of the matrix equation (6.76). 

131 



6 APPROXIMATION METHODS 



6.6 The linear Stark effect 
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Let us examine the effect of an electric field on the excited energy levels of a 
hydrogen atom. For instance, consider the n = 2 states. There is a single I = 
state, usually referred to as 2s, and three I = 1 states (with m = —1,0,1), 
usually referred to as 2p. All of these states possess the same energy, E200 = 
— e 2 /(327teo ao). As in Sect. 6.4, the perturbing Hamiltonian is 



Hi 



e |E| z. 



(6.81) 



In order to apply perturbation theory, we have to solve the matrix eigenvalue 
equation 

Ux = Ax, (6.82) 

where U is the array of the matrix elements of Hi between the degenerate 2s and 
2p states. Thus, 

/ (2,0,0|z|2,1,0) 0\ 

(2, l,0|z|2, 0,0) 







U = e|E| 



V 



(6.83) 



where the rows and columns correspond to the |2, 0,0), |2,1,0), |2,1,1), and 
|2,1,— 1) states, respectively. Here, we have made use of the selection rules, 
which tell us that the matrix element of z between two hydrogen atom states is 
zero unless the states possess the same m quantum number, and I quantum num- 
bers which differ by unity. It is easily demonstrated, from the exact forms of the 
2s and 2p wave-functions, that 



(2, 0, 0|z|2, 1 , 0) = (2, 1 , 0|z|2, 0, 0) = 3 a . 



(6.84) 



It can be seen, by inspection, that the eigenvalues of U are Ai = 3eao|E|, 
A2 = — 3 e ao |E|, A3 = 0, and A4 = 0. The corresponding eigenvectors are 



Xl 



1/V2 





(6.85) 
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X 2 



x 3 



x 4 



I VV2 \ 

^/V2 
o 
o 

(o\ 



1 



(o\ 




V 1 / 



(6.86) 



(6.87) 



(6.88) 



It follows from Sect. 6.5 that the simultaneous eigenstates of the unperturbed 
Hamiltonian and the perturbing Hamiltonian take the form 

12,0,0) + |2, 1,0) 



|1 

12) 

13) 
14) 



V2 

12,0,0) -|2, 1,0) 



y/2 



12,1,1 



12,1, -V 



(6.89) 

(6.90) 

(6.91) 
(6.92) 



In the absence of an electric field, all of these states possess the same energy, E2oo- 
The first-order energy shifts induced by an electric field are given by 



AE 2 
AE 3 
AE 4 



+3 e d |E|, 
—3 e cio |E|, 

0, 
0. 



(6.93) 
(6.94) 
(6.95) 
(6.96) 



Thus, the energies of states 1 and 2 are shifted upwards and downwards, respec- 
tively, by an amount 3 e do |E| in the presence of an electric field. States 1 and 
2 are orthogonal linear combinations of the original 2s and 2p(m = 0) states. 
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Note that the energy shifts are linear in the electric field-strength, so this is a 
much larger effect that the quadratic effect described in Sect. 6.4. The energies 
of states 3 and 4 (which are equivalent to the original 2p (m = 1 ) and 2p (m = — 1 ) 
states, respectively) are not affected to first-order. Of course, to second-order the 
energies of these states are shifted by an amount which depends on the square of 
the electric field-strength. 

Note that the linear Stark effect depends crucially on the degeneracy of the 2s 
and 2p states. This degeneracy is a special property of a pure Coulomb potential, 
and, therefore, only applies to a hydrogen atom. Thus, alkali metal atoms do not 
exhibit the linear Stark effect. 



6.7 Fine structure 

Let us now consider the energy levels of hydrogen-like atoms (i.e., alkali metal 
atoms) in more detail. The outermost electron moves in a spherically symmetric 
potential V(r) due to the nuclear charge and the charges of the other electrons 
(which occupy spherically symmetric closed shells). The shielding effect of the 
inner electrons causes V(r) to depart from the pure Coulomb form. This splits the 
degeneracy of states characterized by the same value of n, but different values of 
I. In fact, higher I states have higher energies. 

Let us examine a phenomenon known as fine structure, which is due to inter- 
action between the spin and orbital angular momenta of the outermost electron. 
This electron experiences an electric field 

VV 

E = . (6.97) 

e 

However, a charge moving in an electric field also experiences an effective mag- 
netic field 

B = -v x E. (6.98) 
Now, an electron possesses a spin magnetic moment [see Eq. (5.170)] 

eS 

\x = . (6.99) 
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We, therefore, expect a spin-orbit contribution to the Hamiltonian of the form 

H LS = -H-B 

eS /1rdV\ 
v x 



m e Ver dr 
1 dV 



m e 2 r dr 



L S, (6.100) 



where L = m e r x v is the orbital angular momentum. When the above expression 
is compared to the observed spin-orbit interaction, it is found to be too large by 
a factor of two. There is a classical explanation for this, due to spin precession, 
which we need not go into. The correct quantum mechanical explanation requires 
a relativistically covariant treatment of electron dynamics (this is achieved using 
the so-called Dime equation). 

Let us now apply perturbation theory to a hydrogen-like atom, using Hls as 
the perturbation (with Hls taking one half of the value given above), and 

P 2 

H = ^- + V(r) (6.101) 
2m e 

as the unperturbed Hamiltonian. We have two choices for the energy eigenstates 
of Ho. We can adopt the simultaneous eigenstates of Ho, L 2 , S 2 , L z and S z , or the 
simultaneous eigenstates of Ho,L 2 ,S 2 , J 2 , and J z , where J = L + S is the total 
angular momentum. Although the departure of V(r) from a pure 1 /r form splits 
the degeneracy of same n, different I, states, those states characterized by the 
same values of n and I, but different values of mi, are still degenerate. (Here, 
mi, m s , and m.j are the quantum numbers corresponding to L z , S z , and J z , respec- 
tively.) Moreover, with the addition of spin degrees of freedom, each state is 
doubly degenerate due to the two possible orientations of the electron spin (i.e., 
m s = ±1/2). Thus, we are still dealing with a highly degenerate system. We 
know, from Sect. 6.6, that the application of perturbation theory to a degenerate 
system is greatly simplified if the basis eigenstates of the unperturbed Hamilto- 
nian are also eigenstates of the perturbing Hamiltonian. Now, the perturbing 
Hamiltonian, Hls, is proportional to L S, where 

T 2 -L 2 -S 2 

L S = . (6.102) 
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It is fairly obvious that the first group of operators (Ho, L 2 , S 2 , L z and S z ) does not 
commute with Hls, whereas the second group (Ho, L 2 , S 2 , J 2 , and J z ) does. In 
fact, L S is just a combination of operators appearing in the second group. Thus, 
it is advantageous to work in terms of the eigenstates of the second group of 
operators, rather than those of the first group. 

We now need to find the simultaneous eigenstates of Ho, L 2 , S 2 , J 2 , and J z . This 
is equivalent to finding the eigenstates of the total angular momentum resulting 
from the addition of two angular momenta: ji = I, and )2 — s = 1/2. According 
to Eq. (5.276), the allowed values of the total angular momentum are j =1+1/2 
and j = 1—1/2. We can write 

|l + l/2,m) = cos ct|m- 1/2,1/2) + since |m+ 1/2,-1/2), (6.103) 
|l-l/2,m) = -sina|m- 1/2,1/2) + cosa|m+ 1/2,-1/2). (6.104) 

Here, the kets on the left-hand side are |j,m.j) kets, whereas those on the right- 
hand side are |mi,m s ) kets (the jij2 labels have been dropped, for the sake of 
clarity). We have made use of the fact that the Clebsch-Gordon coefficients are 
automatically zero unless mj =mi + m s . We have also made use of the fact that 
both the |j,m.j) and |mi,m s ) kets are orthonormal, and have unit lengths. We 
now need to determine 



where the Clebsch-Gordon coefficient is written in (mi, m s |j, mj) form. 

Let us now employ the recursion relation for Clebsch-Gordon coefficients, 
Eq. (5.282), with)! = l,j 2 = 1/2, j = 1+ 1/2, mi = m - 1/2, m 2 = 1/2 (lower 
sign). We obtain 



VTL+ 1/2) (1 + 3/2) -m(m+1) (m - 1 /2, 1 /2|l + 1 /2, m) 
= v/l(l + l)-(m-1/2) (m+1/2) (m+ 1/2, 1/2|l+ 1/2, m+ 1), (6.106) 



cos oc = (m — 1 /2, 1 /2|l + 1 /2, m) 



(6.105) 



which reduces to 



(m-1/2,1/2|l+1/2,m) 



l + m+ 1/2 
^l + m + 3/2 



(m+1/2,1/2|l+1/2,m+1). (6.107) 
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We can use this formula to successively increase the value of m\. For instance, 



l + m+ 1/2 



I + m + 3/2 



(m-1/2,1/2|l + 1/2,m) 



This procedure can be continued until vn\ attains its maximum possible value, I. 
Thus, 



^l + m + 3/2^l + m + 5/2 

x(m + 3/2,l/2|l + l/2,m + 2). (6.108) 



m-1/2,1/2|l + 1/2,m 



l + m+1/2 
N 21 + 1 



(I, l/2|l+l/2,l+ 1/2). (6.109) 



Consider the situation in which mi and m both take their maximum values, 
I and 1 /2, respectively. The corresponding value of m.j is I + 1 /2. This value is 
possible when j =1+1/2, but not when j = 1—1/2. Thus, the |mi, m s ) ket |l, 1 /2) 
must be equal to the |j, rrij) ket |l + 1/2, 1 + 1 /2), up to an arbitrary phase-factor. 
By convention, this factor is taken to be unity, giving 



(l,l/2|l + l/2,l+ 1/2) = 1 
It follows from Eq. (6.109) that 



(6.110) 



coscx= (m- 1/2, l/2|l+ 1/2, m) 



l + m+1/2 
N 21+1 ' 



Now, 



.2 , l + m+1/2 l-m+1/2 
sin oc = I — 



21+1 



21 + 1 



(6.111) 



(6.112) 



We now need to determine the sign of sin oc. A careful examination of the 
recursion relation, Eq. (5.282), shows that the plus sign is appropriate. Thus, 



|l+1/2,m) 



l + m+1/2 
\ 21+1 



jm- 1/2, 1/2) 



+< 



l-m+1/2 



^^^lm+1/2,-1/2), 



(6.113) 
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|l- 1/2, m) 



l-m+1/2 
"N 21+1 



+ 



\ 



l + m + 1/2 



m- 1/2,1/2) 



m+ 1/2,-1/2). 



(6.114) 



21+1 

It is convenient to define so called spin-angular functions using the Pauli two- 
component formalism: 



j=l±1/2,m 



l + m+1/2 m _ 1/2 

N 2i + i (e ' (p)x+ 



l + m+1/2 m+1/2 



\ 21+1 



1 



x/UTT 



i^/l ± m + 1/2 Y L m " 1/2 (e, <p) 
l + m+1/2 Yr +1/2 (9,(p) 



(6.115) 



These functions are eigenfunctions of the total angular momentum for spin one- 
half particles, just as the spherical harmonics are eigenfunctions of the orbital 
angular momentum. A general wave-function for an energy eigenstate in a 
hydrogen-like atom is written 

lpnlm± = Rnl(r)^ =l±1/2 ' m . (6.116) 

The radial part of the wave-function, R n i(r), depends on the radial quantum num- 
ber n and the angular quantum number I. The wave-function is also labeled by 
m, which is the quantum number associated with J z . For a given choice of I, the 
quantum number j (i.e., the quantum number associated with J 2 ) can take the 
values I ± 1/2. 

The |l ± 1 /2,m) kets are eigenstates of L S, according to Eq. (6.102). Thus, 



L- S | j = I ± 1 /2, m.j = m) = - 
giving 

L-S|l + 1/2, m 
L-S |l — 1/2, m 



h 1 



[j(j + 1)-l(l + 1)-3/4] |j,m) 



lh 2 



|l + 1/2,m), 



(1+1) h 2 



|l-1/2,m 



(6.117) 

(6.118) 
(6.119) 
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It follows that 

( y+1/2,m)t L . S y+1/2,m m 
( y-1/2,m)t L . S y-1/2,m dQ 

where the integrals are over all solid angle. 



IFi 2 

~2~' 

(1 + 1) Fi 2 



(6.120) 
(6.121) 



Let us now apply degenerate perturbation theory to evaluate the shift in energy 
of a state whose wave-function is ip n im± due to the spin-orbit Hamiltonian Hls- 
To first-order, the energy-shift is given by 



AE 



nlm± 



(^nlmi) 1 " H LS lklm± dV, 



(6.122) 



where the integral is over all space. Equations (6.100) (remember the factor of 
two), (6.116), and (6.120)-(6.121) yield 



AE 



1 /1dV\lFi 2 



nlm- 



AE 



2 m e 2 \ r dr / 2 
1 /I dV\ (1+1) Fi 2 



nlm- 



where 



1 dV* 



r dr 



2m e 2 \r dr 



(Rm)*-^Rnir 2 dr. 
r dr 



(6.123) 
(6.124) 

(6.125) 



Equations (6.123)-(6.124) are known as Lande's interval rule. 

Let us now apply the above result to the case of a sodium atom. In chemist's 
notation, the ground state is written 



(1s) 2 (2s) 2 (2p) 6 (3s). 



(6.126) 



The inner ten electrons effectively form a spherically symmetric electron cloud. 
We are interested in the excitation of the eleventh electron from 3s to some 
higher energy state. The closest (in energy) unoccupied state is 3p. This state 
has a higher energy than 3s due to the deviations of the potential from the pure 
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Coulomb form. In the absence of spin-orbit interaction, there are six degenerate 
3p states. The spin-orbit interaction breaks the degeneracy of these states. The 
modified states are labeled (3p)i/2 and (3p)3/2, where the subscript refers to the 
value of j. The four (3p)3/2 states lie at a slightly higher energy level than the two 
(3p)i/2 states, because the radial integral (6.125) is positive. The splitting of the 
(3p) energy levels of the sodium atom can be observed using a spectroscope. The 
well-known sodium D line is associated with transitions between the 3p and 3s 
states. The fact that there are two slightly different 3p energy levels (note that 
spin-orbit coupling does not split the 3s energy levels) means that the sodium D 
line actually consists of two very closely spaced spectroscopic lines. It is easily 
demonstrated that the ratio of the typical spacing of Balmer lines to the splitting 
brought about by spin-orbit interaction is about 1 : oc 2 , where 

e 2 1 

oc = —— = — (6.127) 
2 eo He 137 

is the fine structure constant. Note that Eqs. (6.123)-(6.124) are not entirely 
correct, since we have neglected an effect (namely, the relativistic mass correction 
of the electron) which is the same order of magnitude as spin-orbit coupling. 



6.8 The Zeeman effect 

Consider a hydrogen-like atom placed in a uniform z-directed magnetic field. The 
change in energy of the outermost electron is 

H B = -n-B, (6.128) 

where 

H = --^(L + 2S) (6.129) 

2m e 

is its magnetic moment, including both the spin and orbital contributions. Thus, 

H B = ^(L Z + 2S Z ). (6.130) 
2m e 

Suppose that the energy-shifts induced by the magnetic field are much smaller 
than those induced by spin-orbit interaction. In this situation, we can treat Hb as 
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a small perturbation acting on the eigenstates of Ho + Hls- Of course, these states 
are the simultaneous eigenstates of J 2 and J z . Let us consider one of these states, 
labeled by the quantum numbers j and m, where j = I ± 1 /2. From standard 
perturbation theory, the first-order energy-shift in the presence of a magnetic 
field is 

AE nlm± = (l± 1/2,m|H B |l± 1/2,m). (6.131) 

Since 

L Z + 2S Z = J z + S z , (6.132) 

we find that 

eB 



AE 



nlm± 



2m e 

Now, from Eqs. (6.113)-(6.114) 

|l±1/2,m) = =b 



(mTi+ (l± 1/2,m|S z |l± 1/2, m> ) 



(6.133) 



l±m+1/2 
\ 21+1 



+< 



l Tm + 1/2 
N 21+1 



m- 1/2, 1/2} 



m+ 1/2,-1/2). 



(6.134) 



It follows that 

(l±1/2,m|S z |l±1/2,m) = 



2(21+1) 
m h 



± 



21+1 



[(l±m + 1/2)-(l + m + 1/2)] 

(6.135) 



Thus, we obtain Lande's formula for the energy-shift induced by a weak magnetic 
field: 



ehB 

AE nlm± = m 

2m P 



1 ± 



1 



21+1 



(6.136) 



Let us apply this theory to the sodium atom. We have already seen that the 
non-Coulomb potential splits the degeneracy of the 3s and 3p states, the latter 
states acquiring a higher energy. The spin-orbit interaction splits the six 3p states 
into two groups, with four j = 3/2 states lying at a slightly higher energy than 
two j = 1/2 states. According to Eq. (6.136), a magnetic field splits the (3p) 3 / 2 
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quadruplet of states, each state acquiring a different energy. In fact, the energy of 
each state becomes dependent on the quantum number m, which measures the 
projection of the total angular momentum along the z-axis. States with higher 
m values have higher energies. A magnetic field also splits the (3p)i/2 doublet 
of states. However, it is evident from Eq. (6.136) that these states are split by a 
lesser amount than the j = 3/2 states. 

Suppose that we increase the strength of the magnetic field, so that the energy- 
shift due to the magnetic field becomes comparable to the energy-shift induced 
by spin-orbit interaction. Clearly, in this situation, it does not make much sense to 
think of Hb as a small interaction term operating on the eigenstates of Ho + Hls- 
In fact, this intermediate case is very difficult to analyze. Let us consider the 
extreme limit in which the energy-shift due to the magnetic field greatly exceeds 
that induced by spin-orbit effects. This is called the Paschen-Back limit. 

In the Paschen-Back limit we can think of the spin-orbit Hamiltonian, Hls, as 
a small interaction term operating on the eigenstates of Ho + Hb- Note that the 
magnetic Hamiltonian, Hb, commutes with L 2 ,S 2 ,L Z ,S Z , but does not commute 
with L 2 , S 2 , J 2 , J z . Thus, in an intense magnetic field, the energy eigenstates of a 
hydrogen-like atom are approximate eigenstates of the spin and orbital angular 
momenta, but are not eigenstates of the total angular momentum. We can label 
each state by the quantum numbers n (the energy quantum number), I, mi, 
and m s . Thus, our energy eigenkets are written |u, I, mi, m s ). The unperturbed 
Hamiltonian, Ho, causes states with different values of the quantum numbers n 
and I to have different energies. However, states with the same value of n and I, 
but different values of mi and m s , are degenerate. The shift in energy due to the 
magnetic field is simply 

AE nLmims = (u,l,mi,m s |H B |n,l,mi,m s ) 

= ^ B (mi + 2m s ). (6.137) 
2m e 

Thus, states with different values of mi + 2m s acquire different energies. 

Let us apply this result to a sodium atom. In the absence of a magnetic field, 
the six 3p states form two groups of four and two states, depending on the values 
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of their total angular momentum. In the presence of an intense magnetic field 
the 3p states are split into five groups. There is a state with mi + 2 m s = 2, a state 
with mi + 2 m s = 1 , two states with mi + 2 m s = 0, a state with mi + 2 m s = — 1 , 
and a state with m L + 2m s = —2. These groups are equally spaced in energy, the 
energy difference between adjacent groups being e PiB/2m e . 

The energy-shift induced by the spin-orbit Hamiltonian is given by 

AE nLmims = (n,l,mi,m s |H LS |n,l,mi,m s ), (6.138) 



where 



Now, 



H LS = — ^-^L-S. (6.139) 
2m/T dr 



(L-S) = (L z S z +(L+S- + L-S + )/2) 

= Pi 2 mim s , (6.140) 



since 



(L ± ) = (S*) = (6.141) 

for expectation values taken between the simultaneous eigenkets of L z and S z . 
Thus, 

ap fi 2 mim s /1dV\ 

AE nlmi1Us = — y~ (--=-). (6.142) 

2 m/ \ r dr / 

Let us apply the above result to a sodium atom. In the presence of an intense 
magnetic field, the 3p states are split into five groups with (mi,m s ) quantum 
numbers (1,1/2), (0,1/2), (1,-1/2), or (-1,1/2), (0,-1/2), and (-1,-1/2), re- 
spectively, in order of decreasing energy. The spin-orbit term increases the en- 
ergy of the highest energy state, does not affect the next highest energy state, 
decreases, but does not split, the energy of the doublet, does not affect the next 
lowest energy state, and increases the energy of the lowest energy state. The net 
result is that the five groups of states are no longer equally spaced in energy. 

The sort of magnetic field-strength needed to get into the Paschen-Bach limit 
is given by 

B PB - ex 2 - e ~ 25 tesla. (6.143) 
e Hq 
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Obviuously this is an extremely large field-strength. 
6.9 Time-dependent perturbation theory 

Suppose that the Hamiltonian of the system under consideration can be written 

H = H + H 1 (t), (6.144) 

where H does not contain time explicitly and Hi is a small time-dependent 
perturbation. It is assumed that we are able to calculate the eigenkets of the 
unperturbed Hamiltonian: 

H |n) =EJu). (6.145) 

We know that if the system is in one of the eigenstates of Ho then, in the ab- 
sence of the external perturbation, it remains in this state for ever. However, 
the presence of a small time-dependent perturbation can, in principle, give rise 
to a finite probability that a system initially in some eigenstate |i) of the unper- 
turbed Hamiltonian is found in some other eigenstate at a subsequent time (since 
|i) is no longer an exact eigenstate of the total Hamiltonian), In other words, a 
time-dependent perturbation causes the system to make transitions between its 
unperturbed energy eigenstates. Let us investigate this effect. 

Suppose that at t = t the state of the system is represented by 

|A> =^c n |u), (6.146) 

n 

where the c n are complex numbers. Thus, the initial state is some linear su- 
perposition of the unperturbed energy eigenstates. In the absence of the time- 
dependent perturbation, the time evolution of the system is given by 

|A,to,t> =^Cnexp([-iE n (t-to)/Ti]|n>. (6.147) 

TL 

Now, the probability of finding the system in state |n) at time t is 

P n (t) = |c n exp[-iE n (t- t )/h]| 2 = |c n | 2 = P n (t ). (6.148) 
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Clearly, with Hi = 0, the probability of finding the system in state |u) at time t is 
exactly the same as the probability of finding the system in this state at the initial 
time to. However, with Hi ^ 0, we expect P n (t) to vary with time. Thus, we can 
write 

|A,t ,t> =^c u (t)exp[-iE u (t-t )/M]|n), (6.149) 

n 

where P n (t) = |c n (t)| 2 . Here, we have carefully separated the fast phase oscilla- 
tion of the eigenkets, which depends on the unperturbed Hamiltonian, from the 
slow variation of the amplitudes c n (t), which depends entirely on the perturba- 
tion (i.e., c n is constant if Hi = 0). Note that in Eq. (6.149) the eigenkets |n) are 
time-independent (they are actually the eigenkets of Ho evaluated at the time to). 

Schrodinger's time evolution equation yields 

g 

iFi— |A,t ,t> =H|A,t ,t) = (H + Hi)|A,t ,t). (6.150) 
ot 

It follows from Eq. (6.149) that 

(H + Hi )|A, t 0y t) = Y_ Cm(t) exp[-i E m (t - t )/Ti] (E m + Hi ) |m). (6.151) 

We also have 



l 



9 ( dc \ 

Fl— |A,to,t> = Y_ + c ™ (t) E ™J exp[-iE m (t- t )/h] |m>, (6.152) 



where use has been made of the time-independence of the kets |m). According 
to Eq. (6.150), we can equate the right-hand sides of the previous two equations 
to obtain 

dc 

Y iTi-r^exp[-iE m (t- t )/Ti]lm) = Y c m (t) exp[-iE m (t - t )/h] Hi |m). 

m m 

(6.153) 

Left-multiplication by (n| yields 

dc 

ih—^ = Y H nm (t) exp[io) nm (t-t )] c m (t), (6.154) 

dt m 

where 

Hnm(t) = (n|H 1 (t)|m>, (6.155) 
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and 



Wnm = r • (6.156) 

n. 

Here, we have made use of the standard orthonormality result, (n|m) = 6 nm . 
Suppose that there are N linearly independent eigenkets of the unperturbed 
Hamiltonian. According to Eq. (6.154), the time variation of the coefficients 
c n , which specify the probability of finding the system in state |n) at time t, is de- 
termined by N coupled first-order differential equations. Note that Eq. (6.154) is 
exact — we have made no approximations at this stage. Unfortunately, we cannot 
generally find exact solutions to this equation, so we have to obtain approximate 
solutions via suitable expansions in small quantities. However, for the particu- 
larly simple case of a two-state system (i.e., N = 2), it is actually possible to 
solve Eq. (6.154) without approximation. This solution is of enormous practical 
importance. 



6.10 The two-state system 

Consider a system in which the time-independent Hamiltonian possesses two 
eigenstates, denoted 

H |l) = EtII), (6.157) 
H 12) = E 2 |2>. (6.158) 

Suppose, for the sake of simplicity, that the diagonal matrix elements of the in- 
teraction Hamiltonian, Hi, are zero: 

(IIHtII) = (2|Hi|2) = 0. (6.159) 

The off-diagonal matrix elements are assumed to oscillate sinusoidally at some 
frequency cu: 

(1|Ht|2) = (2|H]|1)* =yexp(icut), (6.160) 

where y and cu are real. Note that it is only the off-diagonal matrix elements 
which give rise to the effect which we are interested in — namely, transitions be- 
tween states 1 and 2. 
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For a two-state system, Eq. (6.154) reduces to 

dci 

i Tx — — = y exp[+i (cu — o)2i) t] C2, (6.161) 
at 

dc2 

i Tx -— — = y exp[— i (cu — cu 2 i) t] ci , (6.162) 
dt 

where a>2i = (E2 — Ei )/Tx, and assuming that to = 0. Equations (6.161) and 
(6.162) can be combined to give a second-order differential equation for the 
time variation of the amplitude C2: 

d 2 C2 . , , dc2 y 2 ^ , 

— f + 1 (cu - CU21 + h c 2 = 0. (6.163) 
dt z dt h 

Once we have solved for C2, we can use Eq. (6.162) to obtain the amplitude ci. 
Let us look for a solution in which the system is certain to be in state 1 at time 
t = 0. Thus, our boundary conditions are ci(0) = 1 and cz[0) = 0. It is easily 
demonstrated that the appropriate solutions are 



y 2 /h / +(cu-cu 21 ) 2 /4 



C2ltJ = / , „ 2 . , - , f exp[-i (cu - CU21J t/2] 



x sin^y 2 /h 2 + (cu-cu 21 ) 2 /4 t) , (6.164) 

d(t) = exp[i(cu-cu 21 )t/2] cos^y 2 /h 2 + (cu - cu 21 ) 2 /4 
i(cu-cu 21 )/2 

/ llrl r ^7/, exp[l(cU-CU 21 )t/2] 

Vy 2 A 2 + (cu-cu 21 ) 2 /4 
x sin^y 2 /h 2 + (cu-cu 21 ) 2 /4 . (6.165) 

Now, the probability of finding the system in state 1 at time t is simply Pi (t) = 
|ci| 2 . Likewise, the probability of finding the system in state 2 at time t is P2(t) = 
IC2I 2 . It follows that 



y 2 /Fi 2 + (cu-cu 2 i) 2 /4 
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X sin 




(6.166) 



Pi(t) 



1-P 2 (t). 



(6.167) 



This result is known as Rabi's formula. 

Equation (6.166) exhibits all the features of a classic resonance. At resonance, 
when the oscillation frequency of the perturbation, cv, matches the frequency 
CU21 , we find that 



According to the above result, the system starts off at t = in state 1 . After a 
time interval nh/2y it is certain to be in state 2. After a further time interval 
7i Pi/2y it is certain to be in state 1, and so on. Thus, the system periodically 
flip-flops between states 1 and 2 under the influence of the time-dependent per- 
turbation. This implies that the system alternatively absorbs and emits energy 
from the source of the perturbation. 

The absorption-emission cycle also take place away from the resonance, when 
cv ^ o>2i . However, the amplitude of oscillation of the coefficient C2 is reduced. 
This means that the maximum value of P 2 (t) is no longer unity, nor is the mini- 
mum value of Pi (t) zero. In fact, if we plot the maximum value of P2(t) as a func- 
tion of the applied frequency, cv, we obtain a resonance curve whose maximum 
(unity) lies at the resonance, and whose full-width half-maximum (in frequency) 
is 4y/Ti. Thus, if the applied frequency differs from the resonant frequency by 
substantially more than 2y/h then the probability of the system jumping from 
state 1 to state 2 is very small. In other words, the time-dependent perturbation 
is only effective at causing transitions between states 1 and 2 if its frequency of 
oscillation lies in the approximate range a>2i ± 2y/Ti. Clearly, the weaker the 
perturbation (i.e., the smaller y becomes), the narrower the resonance. 



Pi(t) 
P 2 (t) 



cos 2 (y t/Ti), 
sin 2 (y t/Ti). 



(6.168) 



(6.169) 
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6.11 Spin magnetic resonance 



Consider a spin one-half system (e.g., a bound electron) placed in a uniform z- 
directed magnetic field, and then subjected to a small time-dependent magnetic 
field rotating in the x-y plane. Thus, 

B = B 2 + B 1 (coscut£ + sincut£), (6.170) 

where B and Bi are constants, with Bi <C B . The rotating magnetic field usu- 
ally represents the magnetic component of an electromagnetic wave propagating 
along the z-axis. In this system, the electric component of the wave has no effect. 
The Hamiltonian is written 

H = -n.B = Ho + H b (6.171) 

where 

H = — S z , (6.172) 
m e 

and 

e Bi 

Hi = (cos cutS x + sin cut St,) . (6.173) 

m e 

The eigenstates of the unperturbed Hamiltonian are the 'spin up' and 'spin 
down' states, denoted |+) and |— }, respectively. Thus, 

H |±>=±i^2|±>. (6.174) 
2m e 

The time-dependent Hamiltonian can be written 

Hi = [exp( i cut) S + exp(-i cut) S+l , (6.175) 

2m e L 

where S + and S are the conventional raising and lowering operators for the spin 
angular momentum. It follows that 

(+|H 1 |+> = (-|Hi|->=0, (6.176) 

and 

(-1^1+) = (+|H!|->* = ^^exp(icut). (6.177) 

2m e 
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It can be seen that this system is exactly the same as the two-state system 
discussed in the previous section, provided that we make the identifications 

11) -> I-}, (6.178) 

12) -> |+>, (6.179) 

cu 2 i -> — , (6.180) 

m e 

y (6.181) 

2m e 

The resonant frequency, a>2i, is simply the spin precession frequency for an elec- 
tron in a uniform magnetic field of strength Bo. In the absence of the perturba- 
tion, the expectation values of S x and S y oscillate because of the spin precession, 
but the expectation value of S z remains invariant. If we now apply a magnetic 
perturbation rotating at the resonant frequency then, according to the analysis of 
the previous section, the system undergoes a succession of spin-flops, |+) ^ |— ), 
in addition to the spin precession. We also know that if the oscillation frequency 
of the applied field is very different from the resonant frequency then there is 
virtually zero probability of the field triggering a spin-flop. The width of the 
resonance (in frequency) is determined by the strength of the oscillating mag- 
netic perturbation. Experimentalist are able to measure the magnetic moments 
of electrons, and other spin one-half particles, to a high degree of accuracy by 
placing the particles in a magnetic field, and subjecting them to an oscillating 
magnetic field whose frequency is gradually scanned. By determining the reso- 
nant frequency (i.e., the frequency at which the particles absorb energy from the 
oscillating field), it is possible to calculate the magnetic moment. 



6.12 The Dyson series 

Let us now try to find approximate solutions of Eq. (6.154) for a general system. 
It is convenient to work in terms of the time evolution operator, U(to, t), which is 
defined 

|A,to,t) =U(to,t)|A>. (6.182) 
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Here, |A,to,t) is the state ket of the system at time t, given that the state ket at 
the initial time to is |A). It is easily seen that the time evolution operator satisfies 
the differential equation 

i Fi 9U(to>t) = (Ho + Hi ) U(to, t) , (6.183) 
ot 

subject to the boundary condition 

U(t ,t ) = l. (6.184) 



In the absence of the external perturbation, the time evolution operator re- 
duces to 

U(t ,t) = exp[-iH (t-t )/Ti]. (6.185) 
Let us switch on the perturbation and look for a solution of the form 

U(to,t) =exp[-iH (t-to)/FL]Ui(to,t). (6.186) 
It is readily demonstrated that Ui satisfies the differential equation 

i h aUl ^° ,t] = Hi(to, t) Uj(to, t), (6.187) 

where 

Hi(to,t) =exp[+iH (t-to)/rt]H 1 exp[-iH (t - t )/rt], (6.188) 
subject to the boundary condition 

Ui(to,to) = 1. (6.189) 

Note that Ui specifies that component of the time evolution operator which is due 
to the time-dependent perturbation. Thus, we would expect Ui to contain all of 
the information regarding transitions between different eigenstates of Ho caused 
by the perturbation. 

Suppose that the system starts off at time to in the eigenstate |i) of the un- 
perturbed Hamiltonian. The subsequent evolution of the state ket is given by 
Eq. (6.149), 

|i,t 0) t> =^c m (t)exp[-iE m (t-t )/fi]|m). (6.190) 

m 
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However, we also have 

|i, t , t) = exp[-i H (t - t )/Ti] Ui(t , t) |i) 



(6.191) 



It follows that 

c n (t) = (n|Ui(to,t)|i>, (6.192) 

where use has been made of (n|m) = 6 nm . Thus, the probability that the system 
is found in state |u) at time t, given that it is definitely in state |i) at time to, is 
simply 

Pwu(t ,t) = |(n|Ui(t ,t)|i)| 2 . (6.193) 

This quantity is usually termed the transition probability between states |i) and 
|n). 

Note that the differential equation (6.187), plus the boundary condition (6.189), 
are equivalent to the following integral equation, 



Ui(to,t) = 1 - 



Pi 



H I (t ,t , )Ui(to,t , )dt / . 



(6.194) 



to 



We can obtain an approximate solution to this equation by iteration: 



Ui(to,t) ~ 1 



Ti 
i 

h 



rt 

to 
rt 

to 



Hi(t ,t') 



Hi(t ,t') dt' 



Pi 



H I (t ,t ,, )U I (to,t ,/ ) 



to 



dt' 



+ 





rt f 


(¥)'j 


dt' 




to 



t' 



Hitto.tOHifto.t") dt" + 



(6.195) 



to 



This expansion is known as the Dyson series. Let 



-n — 



r (0) i r (D i M) i 



(6.196) 



where the superscript (1) refers to a first-order term in the expansion, etc. It 
follows from Eqs. (6.192) and (6.195) that 



cf(t) = 6 lu , 



(6.197) 
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1 

h 



t) 



n|Hi(t ,t')|i) dt', 

to 

2 rt 



dt' 



to 



rt' 



to 



n|H I (to,t , )H I (to,t ,, )|i) dt". 



These expressions simplify to 

Cfft) = Sin, 



c'"(t) = - 



c« 2 »(t) = 



1 



rt 



exp[icu ni (t / -t )]H ni (t / )dt / , 



to 
2 

m J 



rt 



dt' 



to 



t' 



dt" exp[icu nm (t'-to)] 



to 



where 



and 



xH nm (t') exp[icu mi (t" - 1 )] H mi (t"), 



h 



Hnm(t) = (n|H!(t)|m>. 
The transition probability between states i and n is simply 

P^n(t ,t) = |c^+C(nc« + ...| 2 . 



(6.198) 
(6.199) 



(6.200) 
(6.201) 



(6.202) 

(6.203) 
(6.204) 

(6.205) 



According to the above analysis, there is no chance of a transition between 
states |i) and |n) (i ^ n) to zeroth-order (i.e., in the absence of the perturbation). 
To first-order, the transition probability is proportional to the time integral of the 
matrix element (n|H-||i), weighted by some oscillatory phase-factor. Thus, if the 
matrix element is zero, then there is no chance of a first-order transition between 
states |i) and |n). However, to second-order, a transition between states |i) and 
|n) is possible even when the matrix element (n|Hi |i) is zero. 
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6.13 Constant perturbations 



Consider a constant perturbation which is suddenly switched on at time t = 0: 

H](t) = fort<0 

H](t) = H] fort> 0, (6.206) 

where Hi is time-independent, but is generally a function of the position, mo- 
mentum, and spin operators. Suppose that the system is definitely in state |i) at 
time t = 0. According to Eqs. (6.200)-(6.202) (with t = 0), 



,(0) 
'n 



(t) 
'(t) 



i 

h 
H 



(6.207) 



H 



TLX 



m 



giving 



Pt-wv(t) ^ |C 



1)l2 



4IH 



exp[itu ni (t' — t)] dt' 
[1 -exp(ia) ni t)], 

(En-Ei)t' 



-n 



Etl 2 



sin 



2Ti 



(6.208) 



(6.209) 



for i^n. The transition probability between states |i) and |n) can be written 



(t) 



la 



i 2 t 2 



sine 



(E n ~ E t ) t 

2Ti 



where 



sinc(x) 



sinx 



(6.210) 



(6.211) 



The sine function is highly oscillatory, and decays like 1 /|x| at large |x|. It is a 
good approximation to say that sinc(x) is small except when |x| < n. It follows 
that the transition probability, Pi^ n , is small except when 

2nh 



Eil 



(6.212) 



Note that in the limit t — > oo only those transitions which conserve energy (i.e., 
E n = Ei) have an appreciable probability of occurrence. At finite t, is is possible 
to have transitions which do not exactly conserve energy, provided that 

AE At $ Pi, (6.213) 
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where AE = |E n — E-jJ is change in energy of the system associated with the 
transition, and At = t is the time elapsed since the perturbation was switched 
on. Clearly, this result is just a manifestation of the well-known uncertainty re- 
lation for energy and time. This uncertainty relation is fundamentally different 
to the position-momentum uncertainty relation, since in non-relativistic quan- 
tum mechanics position and momentum are operators, whereas time is merely a 
parameter. 

The probability of a transition which conserves energy (i.e., E n = E t ) is 

Pwn(t) = (6.214) 
n 

where use has been made of sinc(O) = 1 . Note that this probability grows quadrat- 
ically with time. This result is somewhat surprising, since it implies that the prob- 
ability of a transition occurring in a fixed time interval, t to t + dt, grows linearly 
with t, despite the fact that Hi is constant for t > 0. In practice, there is usually 
a group of final states, all possessing nearly the same energy as the energy of the 
initial state It is helpful to define the density of states, p(E), where the num- 
ber of final states lying in the energy range E to E + dE is given by p(E) dE. Thus, 
the probability of a transition from the initial state i to any of the continuum of 
possible final states is 



giving 

n. 



Pwu(t) p(E n ) dE n , (6.215) 



|H ni | 2 p(E n ) sinc 2 (x) dx, (6.216) 



where 

x= (E n -Ei)t/2 ft, (6.217) 

and use has been made of Eq. (6.210). We know that in the limit t — > oo the 
function sinc(x) is only non-zero in an infinitesimally narrow range of final ener- 
gies centred on E n = E t . It follows that, in this limit, we can take p(E n ) and |H T 12 
out of the integral in the above formula to obtain 



'ml 



In 



Pw[n](t) = — |H ni | 2 p(E n )t 



(6.218) 
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where Pi->[ n ] denotes the transition probability between the initial state |i) and 
all final states |n) which have approximately the same energy as the initial state. 
Here, |H ni | 2 is the average of |H ni | 2 over all final states with approximately the 
same energy as the initial state. In deriving the above formula, we have made 
use of the result 

TOO 

sinc 2 (x) dx = n. (6.219) 

Note that the transition probability, Pi_»[ n ], is now proportional to t, instead of t 2 . 

It is convenient to define the transition rate, which is simply the transition 
probability per unit time. Thus, 

w^ [u] = (6.220) 

giving 

2n 



Wi->[n] = ^-|H ni | 2 p(E n ) 



(6.221) 

This appealingly simple result is known as Fermi's golden rule. Note that the 
transition rate is constant in time (for t > 0): i.e., the probability of a transition 
occurring in the time interval t to t + dt is independent of t for fixed dt. Fermi's 
golden rule is sometimes written 

w^ n = ^ |H ni | 2 6(E n - E), (6.222) 
n, 

where it is understood that this formula must be integrated with J p(E n ) dE n to 
obtain the actual transition rate. 

Let us now calculate the second-order term in the Dyson series, using the 
constant perturbation (6.206). From Eq. (6.202) we find that 



2 



= (^) L H ^ H 

h ^7" E m — E| t 



rru 

m 

ft 



t 



dt' exp(icu nm t / ) 



pt' 



dt exp(icu mi t) 



i * Hum H mi 



[exp( i cu ni t' ) - exp( i cu nm t'] ) dt' 



it ^- H nm H^ t [ exp ( ia)n . t /2)si nc (a) ni t/2) 



h- m E m E x 
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exp(icu nm t/2)sinc(a) nm t/2)] 



(6.223) 



Thus, 



Cn(t) 



c^ IJ + c 



(2) 
n 



it 



exp(icu ni t/2) 



Pi 



Hn-m H 



nm 1 l mi 



m 



-m 



Hi 



sinc(cu ni t/2) 



H n ni H m i 



m 



-m 



exp( ia) im t/2) sinc(cu nm t/2) 



(6.224) 



where use has been made of Eq. (6.208). It follows, by analogy with the previous 
analysis, that 



n 



2n 

IT 



H-rmi Hmi 



m 



-m 



Hi 



p(E n ) 



(6.225) 



where the transition rate is calculated for all final states, |n), with approximately 
the same energy as the initial state, and for intermediate states, |m) whose 
energies differ from that of the initial state. The fact that E m ^ Ei causes the 
last term on the right-hand side of Eq. (6.224) to average to zero (due to the 
oscillatory phase-factor) during the evaluation of the transition probability. 

According to Eq. (6.225), a second-order transition takes place in two steps. 
First, the system makes a non-energy-conserving transition to some intermedi- 
ate state |m). Subsequently, the system makes another non-energy-conserving 
transition to the final state |u). The net transition, from |i) to |n), conserves en- 
ergy. The non-energy-conserving transitions are generally termed virtual transi- 
tions, whereas the energy conserving first-order transition is termed a real transi- 
tion. The above formula clearly breaks down if H nm H m i ^ when E m = E t . 
This problem can be avoided by gradually turning on the perturbation: i.e., 
Hi — > exp(ut) Hi (where r\ is very small). The net result is to change the en- 
ergy denominator in Eq. (6.225) from E| — E m to E t — E m + i hr|. 
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6.14 Harmonic perturbations 

Consider a perturbation which oscillates sinusoidally in time. This is usually 
called a harmonic perturbation. Thus, 

Hi (t) = V exp( i cut) + V 1 " exp(-i cut) , (6.226) 

where V is, in general, a function of position, momentum, and spin operators. 

Let us initiate the system in the eigenstate |i) of the unperturbed Hamiltonian, 
Ho, and switch on the harmonic perturbation at t = 0. It follows from Eq. (6.201) 
that 



c (1) 



— 1 



ft 



o 



V ni exp(i cut') + Vja exp(-i cut')] exp( i cu ni t') dt', (6.227) 



where 



^ ( 1 -exp[i(cu ni + cu)t] ^ 1 -exp[i(cu ni - cu)t] t 
Pi \ cu ni + cu cu ni — cu ' 



V ni = (n|V|i), (6.228) 
yt. = (nlV^i) = (i|V|n)*. (6.229) 

This formula is analogous to Eq. (6.208), provided that 

CU n i = ^ U - ^ X — > CU n i ± CU. (6.230) 

ri 

Thus, it follows from the previous analysis that the transition probability P i _ >n (t) = 
|c^| 2 is only appreciable in the limit t — > oo if 

cu ni + cu~0 or E n ~Et— rtcu, (6.231) 
cu ni — cu ~ or E n ~ E t + Ftcu. (6.232) 

Clearly, (6.231) corresponds to the first term on the right-hand side of Eq. (6.227), 
and (6.232) corresponds to the second term. The former term describes a process 
by which the system gives up energy hcu to the perturbing field, whilst making a 
transition to a final state whose energy level is less than that of the initial state 
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by Picu. This process is known as stimulated emission. The latter term describes 
a process by which the system gains energy rtcu from the perturbing field, whilst 
making a transition to a final state whose energy level exceeds that of the initial 
state by rtcu. This process is known as absorption. In both cases, the total energy 
(i.e., that of the system plus the perturbing field) is conserved. 



By analogy with Eq. (6.221), 



2n 

Wi-,[n] = Y' Vni ' 2p ( En ) 

2n 



(6.233) 
(6.234) 

E n =Ei+Tio) 



Equation (6.233) specifies the transition rate for stimulated emission, whereas 
Eq. (6.234) gives the transition rate for absorption. These equations are more 
usually written 

w^u = — |V ni | 2 6(E n -Et + hcu), (6.235) 



w wn = -r l»ni 



Pi 

271 

IT 



V 1 1 li | 2 6(E n -E i - Tiao). (6.236) 



It is clear from Eqs. (6.228)-(6.229) that IV^I 2 = |V ni | 2 . It follows from 
Eqs. (6.233)-(6.234) that 

to. . , hi 

(6.237) 



P(E n ) P(Ei) ' 

In other words, the rate of stimulated emission, divided by the density of final 
states for stimulated emission, equals the rate of absorption, divided by the den- 
sity of final states for absorption. This result, which expresses a fundamental 
symmetry between absorption and stimulated emission, is known as detailed bal- 
ancing, and is very important in statistical mechanics. 



6.15 Absorption and stimulated emission of radiation 

Let us use some of the results of time-dependent perturbation theory to inves- 
tigate the interaction of an atomic electron with classical (i.e., non-quantized) 
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electromagnetic radiation. 

The unperturbed Hamiltonian is 

P 2 

H = ^ + V (r). (6.238) 
2m e 

The standard classical prescription for obtaining the Hamiltonian of a particle of 
charge q in the presence of an electromagnetic field is 

p -> p + qA, (6.239) 

H -> H-qc|), (6.240) 

where A(r) is the vector potential and 4>(r) is the scalar potential. Note that 

E = -V*-^, (6.241) 

B = V x A. (6.242) 

This prescription also works in quantum mechanics. Thus, the Hamiltonian of an 
atomic electron placed in an electromagnetic field is 

H = (P ~ gA] + e 4 + V (r) , (6.243) 
2m e 

where A and c|) are functions of the position operators. The above equation can 
be written 

fp 2 - eAp - ep A + e 2 A 2 ) , , _ 

H = ^ ^— " " + e * + Vo t • (6-244) 

2m e 

Now, 

pA = Ap, (6.245) 
provided that we adopt the gauge V A = 0. Hence, 

H = ^-- + + + V (t). (6.246) 
2 m e m e 2 m e 

Suppose that the perturbation corresponds to a monochromatic plane-wave, 
for which 

4 = 0, (6.247) 

A = 2A e cos^n-r- cutj , (6.248) 
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where e and n are unit vectors which specify the direction of polarization and 
the direction of propagation, respectively. Note that e n = 0. The Hamiltonian 
becomes 

H = H + H 1 (t), (6.249) 

with 

P 2 

H = ^- + V(r), (6.250) 
2m e 

and 

Hi ~ _ eA J 5 ) (6.251) 

m e 

where the A 2 term, which is second order in Ao, has been neglected. 
The perturbing Hamiltonian can be written 

Hi = - gA ° e ' P (exp[i(a)/c)n-r-ia)t] + exp[-i (cu/c) n r + itut]) . (6.252) 
m e 

This has the same form as Eq. (6.226), provided that 

V = - eA ° £ ' P exp[-i (cu/c) n-r] (6.253) 

It is clear, by analogy with the previous analysis, that the first term on the right- 
hand side of Eq. (6.252) describes the absorption of a photon of energy Picu, 
whereas the second term describes the stimulated emission of a photon of energy 
ho). It follows from Eq. (6.236) that the rate of absorption is 

27i e 2 t 7 

Wi^ n = — |Aol \{n\ exp[i (eu/c) n-r] e-p |i)| 6(E n — Ei — hcu). (6.254) 

Pi m/ 

The absorption cross-section is defined as the ratio of the power absorbed by 
the atom to the incident power per unit area in the electromagnetic field. Now 
the energy density of an electromagnetic field is 

U 1(*J£ + B£), (6.255) 
212 2(i / 
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where Eo and Bo = Eo/c = 2Aocu/c are the peak electric and magnetic field- 
strengths, respectively. The incident power per unit area of the electromagnetic 
field is 

cU = 2e ccu 2 |A | 2 . (6.256) 



Now, 



so 



Oabs = r. ' (6.257) 

cU 



7t e 2 7 

0abs = 1 I(ti| exp[i (o)/c) n r] e p |i)| 6(E n — E| — hcv). (6.258) 

eo m e z cu c 



6.16 The electric dipole approximation 

In general, the wave-length of the type of electromagnetic radiation which in- 
duces, or is emitted during, transitions between different atomic energy levels is 
much larger than the typical size of a light atom. Thus, 

exp[i(cu/c)n-r] = 1 +i — n-rH , (6.259) 

c 

can be approximated by its first term, unity (remember that cu/c = 2tt/A). This 
approximation is known as the electric dipole approximation. It follows that 

(rt| exp[i (cu/c) n-r] e-p |i) ~ e- (n|p|i). (6.260) 

It is readily demonstrated that 

[r,H ] = 1 - h ^, (6.261) 

m e 

so 

TTL 

(n|p|i) = — i — - (n| [r, H ] |i) = i m e cu ni (n|r|i) . (6.262) 
Using Eq. (6.258), we obtain 

Oabs = 47t 2 cxcu ni |(n|e-r|i}| 6(cu — cu ni ), (6.263) 
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where oc = e 2 /(2eo He) = 1/137 is the fine structure constant. It is clear that if 
the absorption cross-section is regarded as a function of the applied frequency, 
cu, then it exhibits a sharp maximum at cu = cu ni = (E n — Ei)/Ti. 

Suppose that the radiation is polarized in the z-direction, so that e = 2. We 
have already seen, from Sect. 6.4, that (n|z|i) = unless the initial and final 
states satisfy 

Al = ±1, (6.264) 
Am = 0. (6.265) 

Here, I is the quantum number describing the total orbital angular momentum 
of the electron, and m is the quantum number describing the projection of the 
orbital angular momentum along the z-axis. It is easily demonstrated that (n|x|i) 
and (n|y|i) are only non-zero if 

Al = ±1, 
Am = ±1. 

Thus, for generally directed radiation (n|e-r|i) is only non-zero if 

Al = ±1, (6.268) 
Am = 0,±1. (6.269) 

These are termed the selection rules for electric dipole transitions. It is clear, for 
instance, that the electric dipole approximation allows a transition from a 2p 
state to a Is state, but disallows a transition from a 2s to a Is state. The latter 
transition is called a forbidden transition. 

Forbidden transitions are not strictly forbidden. Instead, they take place at a 
far lower rate than transitions which are allowed according to the electric dipole 
approximation. After electric dipole transitions, the next most likely type of tran- 
sition is a magnetic dipole transition, which is due to the interaction between the 
electron spin and the oscillating magnetic field of the incident electromagnetic 
radiation. Magnetic dipole transitions are typically about 1 5 times more unlikely 
than similar electric dipole transitions. The first-order term in Eq. (6.259) yields 



(6.266) 
(6.267) 
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so-called electric quadrupole transitions. These are typically about 10 s times more 
unlikely than electric dipole transitions. Magnetic dipole and electric quadrupole 
transitions satisfy different selection rules than electric dipole transitions: for 
instance, the selection rules for electric quadrupole transitions are Al = 0,±2. 
Thus, transitions which are forbidden as electric dipole transitions may well be 
allowed as magnetic dipole or electric quadrupole transitions. 

Integrating Eq. (6.263) over all possible frequencies of the incident radiation 
yields 

cxcUrvi |(n|e-r|i)| . (6.270) 

n 

Suppose, for the sake of definiteness, that the incident radiation is polarized in 
the x-direction. It is easily demonstrated that 

Pi 2 

[x,[x,Ho]] = . (6.271) 

rru 



Thus, 



(i|[x, [x,H ]]|i> = (i|x 2 H + HoX 2 -2xH x|i) = , (6.272) 



giving 

Pi 2 

2 y~ ((i|x|n)Et(n|x|i) — (i|x|n)E n (n|x|i)) = . (6.273) 

n m e 

It follows that 

—A Y cu ni | (n|x|i) | 2 = 1 . (6.274) 

This is known as the Thomas-Reiche-Kuhn sum rule. According to this rule, 
Eq. (6.270) reduces to 

, , 1 2n 2 ocfi ne 2 

Oabslcuj dcv = = . (6.275) 

m e 2e m e c 

Note that Ti has dropped out of the final result. In fact, the above formula is ex- 
actly the same as that obtained classically by treating the electron as an oscillator. 
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6.17 Energy-shifts and decay-widths 



We have examined how a state |n), other than the initial state becomes popu- 
lated as a result of some time-dependent perturbation applied to the system. Let 
us now consider how the initial state becomes depopulated. 

It is convenient to gradually turn on the perturbation from zero at t = — oo. 
Thus, 

H](t) = exp(ut)H 1 , (6.276) 
where r\ is small and positive, and Hi is a constant. 

In the remote past, t — > — oo, the system is assumed to be in the initial state 
Thus, Ci(t — > — oo) = 1, and c n7 ^(t — > — oo) = 0. Basically, we want to calculate 
the time evolution of the coefficient Ci(t). First, however, let us check that our 
previous Fermi golden rule result still applies when the perturbing potential is 
turned on slowly, instead of very suddenly. For c n ^[i) we have from Eqs. (6.200)- 
(6.201) that 

cf(t) = 0, (6.277) 



"n 

c4%) = -^H ni rt 



exp[(r| + iu> ni )t'] dt' 



= _i Hni eXp[(l1 + ia '" )tl , (6.278) 
Fi n + l cunt 

where H ni = (n|Hi|i). It follows that, to first-order, the transition probability 
from state |i) to state |n) is 

Pi _(t) = Icl'ip = l>^^4. (6.279) 

The transition rate is given by 

dP^ n 2|H ni | 2 rjexp(2r|t) f , 1Qm 

Wwn(t) = —7— = —2 t- (6.280) 



TLX 



Consider the limit r| — > 0. In this limit, exp(r| t) — > 1, but 

ri^0rj z + cu 



lim^^ Y = n8{w ni ) = n Pi6(E n - EJ. (6.281) 
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Thus, Eq. (6.280) yields the standard Fermi golden rule result 

In 



w 



x— m 



Pi 



|H ni | z 6(E n -E t ). 



(6.282) 



It is clear that the delta-function in the above formula actually represents a func- 
tion which is highly peaked at some particular energy. The width of the peak is 
determined by how fast the perturbation is switched on. 



Let us now calculate Ct(t) using Eqs. (6.200)-(6.202). We have 



c! 0) ( t) = i, 



cl"(t) 



c! 2) (t} 



rt 



ft 

-V 

h 



expfntOdt^-iHu?^ 

rt n 



(6.283) 
(6.284) 



y~ lH-m.ll 



m 



dt' 



t' 



dt' 



x exp[(n + ieu im )t / ] exp[(n + iu> mi )t"], 



m 

Thus, to second-order we have 

Ci(t) : 



2 exp(2nt) 
nul 2n (u + ia> mi ) 



(6.285) 



H^H^+a IH 



+ 



-i\ 



r\ \TiJ 
H mi | 2 exp(2nt) 



exp(2n t) 



hj ^■ i 2n(E i -E m + iriri) 



(6.286) 



Let us now consider the ratio Ci/c\, where Ci = dci/dt. Using Eq. (6.286), we 
can evaluate this ratio in the limit r| — > 0. We obtain 



Ci 
Ci 



IH 



mil 



h) Hii+ (ri) n + ( F L ;^E i -E m + iFiTi 



i H 



XX 



W 1~| 



IH 



mil 



^) H « + ^o(ir)£E i -E m + i^ 
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This result is formally correct to second-order in perturbed quantities. Note that 
the right-hand side of Eq. (6.287) is independent of time. We can write 



At, (6.288) 



Ci = /-T 

Ci U/ 

where 

At = H tt + Km Y ^4 . - (6.289) 

is a constant. According to a well-known result in pure mathematics, 

lim — ] — = P - - i7t6(x), (6.290) 

e^O X + 1 e X 

where e > 0, and P denotes the principle part. It follows that 

At = H u + P Y_ } H ™t -inY_ |H mi | 2 5(Ei - E m ). (6.291) 

It is convenient to normalize the solution of Eq. (6.288) so that ct(0) = 1. 
Thus, we obtain 

d(t) = exp(^±y (6.292) 

According to Eq. (6.149), the time evolution of the initial state ket |i) is given by 

|i,t> = exp[-i(Ai + Ei)t/rt] |i). (6.293) 

We can rewrite this result as 

|i,t) = exp(-i[Ei + Re(Ai)]t/ri) exp[Im(Ai) t/Ti] |i>. (6.294) 

It is clear that the real part of A\ gives rise to a simple shift in energy of state 
whereas the imaginary part of A-t governs the growth or decay of this state. Thus, 



where 



|i, t) = exp[-i (Ei + AEi) t/h] exp{-Vi t/2h) |i) , (6.295) 



AE, = Re(Ai) = H ii + ?Y_ (6-296) 
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and 

r\ 2Im(Ai) 2ti^- 2 r)Q 7 \ 

. = =- = -v- > |H mi | z 6(Ei- E m J. (6.297) 

n, n Pi 

Note that the energy-shift AE| is the same as that predicted by standard time- 
independent perturbation theory. 

The probability of observing the system in state |i) at time t > 0, given that it 
is definately in state |i) at time t = 0, is given by 

P^t) = | Ci | 2 = exp(-Tit/Ti), (6.298) 

where 

Here, use has been made of Eq. (6.222). Clearly, the rate of decay of the initial 
state is a simple function of the transition rates to the other states. Note that the 
system conserves probability up to second-order in perturbed quantities, since 

Icil 2 + X>ml 2 - (1 -r t t/Ti)+^w l _ m t = 1. (6.300) 

The quantity A t is called the decay-width of state It is closely related to the 
mean lifetime of this state, 

Ti = (6.301) 

where 

Pi^i = exp(-t/Ti). (6.302) 

According to Eq. (6.294), the amplitude of state |i) both oscillates and decays as 
time progresses. Clearly, state |i) is not a stationary state in the presence of the 
time-dependent perturbation. However, we can still represent it as a superposi- 
tion of stationary states (whose amplitudes simply oscillate in time) . Thus, 



exp[-i (E t + AE0 t/Ti] exp(-rit/2Ti) 



f (E) exp(-i E t/Ti) dE, (6.303) 
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where f (E) is the weight of the stationary state with energy E in the superposition. 
The Fourier inversion theorem yields 

|f(E)|2 * w^w^wrm- (6 - 304) 

In the absence of the perturbation, |f (E)| 2 is basically a delta-function centred on 
the unperturbed energy E t of state |i) . In other words, state |i) is a stationary state 
whose energy is completely determined. In the presence of the perturbation, the 
energy of state |i) is shifted by Re(Ai). The fact that the state is no longer station- 
ary (i.e., it decays in time) implies that its energy cannot be exactly determined. 
Indeed, the energy of the state is smeared over some region of width (in energy) 
Vi centred around the shifted energy E t + Re( AO . The faster the decay of the state 
(i.e., the larger TO, the more its energy is spread out. This effect is clearly a man- 
ifestation of the energy-time uncertainty relation AE At ~ Pi. One consequence of 
this effect is the existence of a natural width of spectral lines associated with the 
decay of some excited state to the ground state (or any other lower energy state) . 
The uncertainty in energy of the excited state, due to its propensity to decay, gives 
rise to a slight smearing (in wave-length) of the spectral line associated with the 
transition. Strong lines, which correspond to fast transitions, are smeared out 
more that weak lines. For this reason, spectroscopists generally favour forbid- 
den lines for Doppler shift measurements. Such lines are not as bright as those 
corresponding to allowed transitions, but they are a lot sharper. 
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7 Scattering theory 
7.1 Introduction 

Historically, data regarding quantum phenomena has been obtained from two 
main sources — the study of spectroscopic lines, and scattering experiments. We 
have already developed theories which account for some aspects of the spectra of 
hydrogen-like atoms. Let us now examine the quantum theory of scattering. 



7.2 The Lipmann-Schwinger equation 

Consider time-independent scattering theory, for which the Hamiltonian of the 
system is written 

H = H + H 1 , (7.1) 
where Ho is the Hamiltonian of a free particle of mass m, 

P 2 

H = , (7.2) 
2m 

and Hi represents the non-time-varying source of the scattering. Let |4>) be an 
energy eigenket of H , 

Ho I4>) =E |4>), (7.3) 

whose wave-function (r'|cj)) is 4>(r')- This state is a plane-wave state or, possibly, 
a spherical-wave state. Schrodinger's equation for the scattering problem is 

(H + H 1 )|i|))=EW)), (7.4) 

where |ip) is an energy eigenstate of the total Hamiltonian whose wave-function 
(r'|xp) is iMr'). In general, both Ho and Ho + Hi have continuous energy spectra: 
i.e., their energy eigenstates are unbound. We require a solution of Eq. (7.4) 
which satisfies the boundary condition |ip) — > |4>) as Hi — > 0. Here, |cj>) is a 
solution of the free particle Schrodinger equation, (7.3), corresponding to the 
same energy eigenvalue. 
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Formally, the desired solution can be written 

1 



W>> = I*} + 



Hr 



HiN>>. 



(7.5) 



Note that we can recover Eq. (7.4) by operating on the above equation with 
E — Ho, and making use of Eq. (7.3). Furthermore, the solution satisfies the 
boundary condition — > |4>) as Hi — > 0. Unfortunately, the operator (E — Ho) 1 
is singular: i.e., it produces infinities when it operates on an eigenstate of Ho 
corresponding to the eigenvalue E. We need a prescription for dealing with these 
infinities, otherwise the above solution is useless. The standard prescription is to 
make the energy eigenvalue E slightly complex. Thus, 



1^ 



±\ 



l*> + 



1 



E - Ho ± i e 



Hifi[> 



(7.6) 



where e is real, positive, and small. Equation (7.6) is called the Lipmann-Schwinger 
equation, and is non-singular as long as e > 0. The physical significance of the ± 
signs will become apparent later on. 

The Lipmann-Schwinger equation can be converted into an integral equation 
via left multiplication by (r| . Thus, 



i|^(r) = cMr) + 



1 



E - Ho ± i e 



r'Wr'lHiN^) dV. 



(7.7) 



Adopting the Schrodinger representation, we can write the scattering problem 
(7.4) in the form 



(V z + k z )iKr) 



where 



" h 2 

h 2 k 2 
2 m ' 



(r|H,M>), 



(7.8) 



(7.9) 



This equation is called Helmholtz's equation, and can be inverted using standard 
Green's function techniques. Thus, 



Mr) =*(r) + T 2- 
n 



G^r'Xr'lHTlij)) dV, 



(7.10) 
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where 

(V 2 + k 2 )G(r,r / ) = 6(r-r'). (7.11) 

Note that the solution (7.10) satisfies the boundary condition — > |4>) as Hi — > 
0. As is well-known, the Green's function for the Helmholtz problem is given by 

exp(±ik|r — r'| ) 



An |r 



Thus, Eq. (7.10) becomes 



^(r) = 4>(r) - 



2m 

h 2 " 



exp(±ik|r — r'| ) 
47t|r — r'l 



(r'lHilip) dV. 



(7.12) 



(7.13) 



A comparison of Eqs. (7.7) and (7.13) suggests that the kernel to Eq. (7.7) takes 
the form 



1 



E-Hn±ie 



2m exp(±ik|r — v'\ ) 



(7.14) 



/ Ft 47t|r-r / | 
It is not entirely clear that the ± signs correspond on both sides of this equation. 
In fact, they do, as is easily proved by a more rigorous derivation of this result. 



(7.15) 



Let us suppose that the scattering Hamiltonian, Hi, is only a function of the 
position operators. This implies that 

(r'lH^r) = V(r)5(r-r'). 

We can write 

(r'lHili^) = (r / |H 1 |r // )(r // |ip ± )d 3 r // 
Thus, the integral equation (7.13) simplifies to 



(7.16) 



i|^(r) = cf>(r) 



2m 

h 2 " 



' exp(±ik|r — r'|) 
47r|r — r'l 



V(r')^(r') dV. 



(7.17) 



Suppose that the initial state |4>) is a plane-wave with wave-vector k (i.e., a 
stream of particles of definite momentum p = Pik). The ket corresponding to 
this state is denoted Ik). The associated wave-function takes the form 



rlk) 



exp( ik-r) 

(2tt) 3 / 2 



(7.18) 
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The wave-function is normalized such that 



(k|r)(r|k') d 6 r 

exp[-ir.(k-k-)] d3r = 6(k _ n (7 . 19) 

[2n) 6 

Suppose that the scattering potential V(r) is only non-zero in some relatively 
localized region centred on the origin (r = 0). Let us calculate the wave-function 
ip(r) a long way from the scattering region. In other words, let us adopt the 
ordering r ^> r'. It is easily demonstrated that 

|r-r'| ~r-*-r' (7.20) 

to first-order in r'/r, where 

f= r (7.21) 
r 

is a unit vector which points from the scattering region to the observation point. 
Let us define 

k' = kf . (7.22) 

Clearly k' is the wave-vector for particles which possess the same energy as the 
incoming particles (i.e., k' — k), but propagate from the scattering region to the 
observation point. Note that 

exp(±ik|r-r'|) ~ exp(±ikr) exp(^ik'-r'). (7.23) 



In the large-r limit, Eq. (7.17) reduces to 



exp(ik-r) m exp(±ikr) 

(271)3/2 ~^~p? " 



exp^ik'-r'jVM^MdV. (7.24) 



The first term on the right-hand side is the incident wave. The second term 
represents a spherical wave centred on the scattering region. The plus sign (on 
i|) ± ) corresponds to a wave propagating away from the scattering region, whereas 
the minus sign corresponds to a wave propagating towards the scattering region. 
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It is obvious that the former represents the physical solution. Thus, the wave- 
function a long way from the scattering region can be written 



iMr) 



1 



(27T) 3 /2 



, , exp( ikr) ,„ . . . 
exp( ik-r) H — -f (k', k) 



where 



f(k',k) = 



(27t) 2 m 



exp(— ik'-r') 
[2nW~ 2 



V(r , )iKr / )dV 



^WH,h|i>. 



(7.25) 



(7.26) 



Let us define the differential cross-section dcr/dO as the number of particles 
per unit time scattered into an element of solid angle dO, divided by the incident 
flux of particles. Recall, from Sect. 4, that the probability flux (i.e., the particle 
flux) associated with a wave-function ij> is 

j = -Im(i|/Vib). 



m 

Thus, the probability flux associated with the incident wave-function, 

exp( ik-r) 

(27l) 3 / 2 ' 



(7.27) 



(7.28) 



is 



J 



inci 



(27t) 3 m 

Likewise, the probability flux associated with the scattered wave-function, 

exp(ikr)f(k / ,k) 

(27r) 3 / 2 r ' 

is 



(7.29) 



(7.30) 



J scat 



Ft |f(k',k)| 2 
(27t) 3 m r 2 



kf. 



Now, 



da _ r 2 dD. liscatl 
— dO = — -^H 
dCL |j 



(7.31) 



(7.32) 



incil 
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giving 

-^ = |f(k / ,k)| 2 . (7.33) 
all 

Thus, Iffk'jk)) 2 gives the differential cross-section for particles with incident mo- 
mentum hk to be scattered into states whose momentum vectors are directed in 
a range of solid angles dO about fik'. Note that the scattered particles possess 
the same energy as the incoming particles (i.e., k' = k). This is always the case 
for scattering Hamiltonians of the form shown in Eq. (7.15). 



7.3 The Born approximation 

Equation (7.33) is not particularly useful, as it stands, because the quantity 
f(k',k) depends on the unknown ket Recall that ip(r) = (r|ip) is the solu- 
tion of the integral equation 

r e xp(-ik / -r / )V(r / )iKr / )dV, (7.34) 



m exp lkr 
27th r 



where 4>(r) is the wave-function of the incident state. According to the above 
equation the total wave-function is a superposition of the incident wave-function 
and lots of spherical-waves emitted from the scattering region. The strength of 
the spherical-wave emitted at a given point is proportional to the local value of 
the scattering potential, V, as well as the local value of the wave-function, ip. 

Suppose that the scattering is not particularly strong. In this case, it is reason- 
able to suppose that the total wave-function, ij>(r), does not differ substantially 
from the incident wave-function, 4>(r). Thus, we can obtain an expression for 
f (k',k) by making the substitution 

This is called the Born approximation. 
The Born approximation yields 

' exp [i (k - k') -r'] V(r') d¥. (7.36) 



f(k',k)~ — — 7 
2nh 2 
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Thus, f(k',k) is proportional to the Fourier transform of the scattering potential 
V(r) with respect to the wave-vector q = k — k\ 



For a spherically symmetric potential, 

exp( i q r' cos 9') V(r') r' 2 dr' sin 9' d9' dcj)', (7.37) 



f(k',k)~ m 



27th 2 
giving 



2 m r0 ° 
f^k)^-^ 
Pi q 



r , V(r / )sin(qr / )dr / . (7.38) 



Note that f (k', k) is just a function of q for a spherically symmetric potential. It is 
easily demonstrated that 

q = |k-k| =2ksin(9/2), (7.39) 

where 9 is the angle subtended between the vectors k and k'. In other words, 9 
is the angle of scattering. Recall that the vectors k and k' have the same length 
by energy conservation. 

Consider scattering by a Yukawa potential 

= Voexp(-M 
|j,r 

where Vo is a constant and 1 /u. measures the "range" of the potential. It follows 
from Eq. (7.38) that 

f(8) = - y n J 2 , (7.41) 



since 

f*00 



exp{-\ir') sin(q r') dr' = 2 _|_ q 2 - ^ 7 - 42 ^ 

Thus, in the Born approximation, the differential cross-section for scattering by a 
Yukawa potential is 

*1 ~ t 2 ^] 2 1 (7 43) 

da-^ru 2 ^^ [2k 2 (1 -cos9) + ^ 2 ] 2 ' U J 

given that 

q 2 = 4k 2 sin 2 (9/2) =2k 2 (l - cos 9). (7.44) 
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The Yukawa potential reduces to the familiar Coulomb potential as u. — > 0, 
provided that Vo/u. — > ZZ / e 2 /47teo. In this limit the Born differential cross- 
section becomes 

da /2mZZV\ ; 1 



da \ 4ne h z J 16k 4 sin 4 (9/2) 
Recall that Pik is equivalent to |p|, so the above equation can be rewritten 

da / ZZ'e 2 \ 2 1 
da ~ \167teoEj sin 4 (9/2)' 

where E = p 2 /2m is the kinetic energy of the incident particles. Equation (7.46) 
is the classical Rutherford scattering cross-section formula. 

The Born approximation is valid provided that ip(r) is not too different from 
4>(r) in the scattering region. It follows, from Eq. (7.17), that the condition for 
ip(r) ~ 4>(r) in the vicinity of r = is 



m 



* exp( ikr') 



2nh z 



r' 



V(r') dV 



«1. (7.47) 



Consider the special case of the Yukawa potential. At low energies, (i.e., k <C \x) 
we can replace exp( ikr') by unity, giving 

^ « 1 (7.48) 

as the condition for the validity of the Born approximation. The condition for the 
Yukawa potential to develop a bound state is 

2 ^ > 2.7, (7.49) 
Pi (J 7 

where Vo is negative. Thus, if the potential is strong enough to form a bound 
state then the Born approximation is likely to break down. In the high-k limit, 
Eq. (7.47) yields 

(7.50) 

Ft (xk 

This inequality becomes progressively easier to satisfy as k increases, implying 
that the Born approximation is more accurate at high incident particle energies. 
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7 A Partial waves 



We can assume, without loss of generality, that the incident wave-function is 
characterized by a wave-vector k which is aligned parallel to the z-axis. The 
scattered wave-function is characterized by a wave-vector k' which has the same 
magnitude as k, but, in general, points in a different direction. The direction of k' 
is specified by the polar angle (i.e., the angle subtended between the two wave- 
vectors), and an azimuthal angle (p about the z-axis. Equation (7.38) strongly 
suggests that for a spherically symmetric scattering potential [i.e., V(r) = V(r)] 
the scattering amplitude is a function of only: 

f(0,(p) =f(9). (7.51) 

It follows that neither the incident wave-function, 

exp(ikz) exp(ikrcos0) 
* r = c, u/2 = rT~Tv2 ' (7.52) 



nor the total wave-function, 
Mr) 1 



(2tt) 3 / 2 

depend on the azimuthal angle cp 



exp(ikr)f(0) 
exp(ikrcos0) H — —-^ 



(7.53) 



Outside the range of the scattering potential, both 4>(r) and ip(r) satisfy the 
free space Schrodinger equation 

(V 2 + k 2 )ip = 0. (7.54) 

What is the most general solution to this equation in spherical polar coordinates 
which does not depend on the azimuthal angle (p? Separation of variables yields 

i|)(T,e) = ^Ri(t)Pi(cos9), (7.55) 

i 

since the Legendre functions Pi.(cos0) form a complete set in -space. The Leg- 
endre functions are related to the spherical harmonics introduced in Sect. 5 via 



Pi(cos0) 



Y L o (0,(p). (7.56) 



\2l + 1 
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Equations (7.54) and (7.55) can be combined to give 

r 2 ^ + 2 + [k 2 r 2 - I (I + 1 )]R t = 0. (7.57) 
dr z dr 

The two independent solutions to this equation are called a spherical Bessel func- 
tion, ji(kr), and a Neumann function, r|i(kr). It is easily demonstrated that 

L / 1 d\ l sinu 



h(y) = -y l ( 



Tii(y) 



-y 



L / 1 d \ l cosy 



(7.58) 



(7.59) 



y dyj y 

Note that spherical Bessel functions are well-behaved in the limit y — > , whereas 
Neumann functions become singular. The asymptotic behaviour of these func- 
tions in the limit y — > oo is 

sin(u — Itt/2) 



m(y) 



y 

cos(u — l7t/2) 

y 



(7.60) 
(7.61) 



We can write 



exp(ikrcos0) = ^ ajifkr) P^cosG), 



(7.62) 



where the cl\ are constants. Note there are no Neumann functions in this expan- 
sion, because they are not well-behaved as r — > 0. The Legendre functions are 
orthonormal, 

Pn(M-) P m (u.) du. = 8 ™ , 
.-1 TL+1/2 

so we can invert the above expansion to give 



a l j l (kr) = (1+1/2) 
It is well-known that 



exp( ikr u.) P\[\i) d\i. 



-i 



h(v) 



M) 1 



exp(iu \i) Pi(u-) d(i, 



(7.63) 



(7.64) 



(7.65) 
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where I = 0, 1 ,2, • • • [see Abramowitz and Stegun (Dover, New York NY, 1965), 
Eq. 10.1.14]. Thus, 

a L = i l (2l+1), (7.66) 

giving 

exp(ikrcos9) = ^ i l (2 1 + 1 ) j L (kr) P^cos 9). (7.67) 

i 

The above expression tells us how to decompose a plane-wave into a series of 
spherical-waves (or "partial waves") . 

The most general solution for the total wave-function outside the scattering 
region is 

Mr) = j 2 ^ 2 ^[A l j l (kr) + B im (kr)]? l (cos9) } (7.68) 

where the A\ and B\ are constants. Note that the Neumann functions are allowed 
to appear in this expansion, because its region of validity does not include the 
origin. In the large-r limit, the total wave-function reduces to 



. sinfkr — ln/2) „ cosfkr — ln/2) 
kr kr 



Pt(cos9), (7.69) 



where use has been made of Eqs. (7.60)-(7.61). The above expression can also 
be written 

IM 1 v- sin(kr-l7t/2 + 60 

^ r -rT^V2/_ Cl t Pi cos , (7.70) 

{2nyi l *— kr 

where the sine and cosine functions have been combined to give a sine function 
which is phase-shifted by b\. 

Equation (7.70) yields 

s 1 n exp[i(kr-l 7 r/2 + 5 l )]-exp[-i(kr-l7r/2 + 5 l )] 

^ r ^ \vi Z_ Cl ^ 

[Xn) 5 ' 1 z — 2ikr 

xP L (cos9), (7.71) 

which contains both incoming and outgoing spherical-waves. What is the source 
of the incoming waves? Obviously, they must be part of the large-r asymptotic 
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expansion of the incident wave-function. In fact, it is easily seen that 

1 ^ exp[i(kr-l7t/2)] - exp[-i(kr- Itt/2)] 

* r - (~> vwi ) l 21+1 

xP L (cos0) (7.72) 
in the large-r limit. Now, Eqs. (7.52) and (7.53) give 

(27rp[^(r)-^(r)] = eXp(ikr) f(9). (7.73) 

r 

Note that the right-hand side consists only of an outgoing spherical wave. This 
implies that the coefficients of the incoming spherical waves in the large-r ex- 
pansions of ip(r) and 4>(r) must be equal. It follows from Eqs. (7.71) and (7.72) 
that 

C l = (2l+1) exp[i(6 l + l?t/2)]. (7.74) 
Thus, Eqs. (7.71)-(7.73) yield 



f(9) = ^(2l + 1) eXp [ l5 ^ sinStP^cose). (7.75) 

1=0 



Clearly, determining the scattering amplitude f (0) via a decomposition into par- 
tial waves (i.e., spherical-waves) is equivalent to determining the phase-shifts b\. 



7.5 The optical theorem 



The differential scattering cross-section du/dQ. is simply the modulus squared of 
the scattering amplitude f (0). The total cross-section is given by 



Ototal 



1 



f(0)| 2 dO 



odcp 



^ (2 1 + 1 )(2 1/ + 1 ) exp[ i (6 t - S v ] 



i i' 



x sin &i sin by ?\{ \x) ? v ( |i) , 



(7.76) 
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where \x = cos 0. It follows that 

" k 2 



O-total = ^ 2j 2 1 + 1 ) Sin2 5l ' (7 - 77) 



where use has been made of Eq. (7.63). A comparison of this result with Eq. (7.75) 
yields 

citotai = ^ Im [f (0)] , (7.78) 
k 

since Pi(1) = 1. This result is known as the optical theorem. It is a reflection 
of the fact that the very existence of scattering requires scattering in the forward 
(0 = 0) direction in order to interfere with the incident wave, and thereby reduce 
the probability current in this direction. 

It is usual to write 

oo 

O-total = Y- 0-1 ' ( 7 - 79 ^ 

1=0 

where 

01 = ^2 (21 + 1) sin 2 6 l (7.80) 

is the 1th partial cross-section: i.e., the contribution to the total cross-section from 
the 1th partial wave. Note that the maximum value for the 1th partial cross-section 
occurs when the phase-shift b\ takes the value n/2. 



7.6 Determination of phase-shifts 

Let us now consider how the phase-shifts b\ can be evaluated. Consider a spher- 
ically symmetric potential V(r) which vanishes for r > a, where a is termed 
the range of the potential. In the region r > a, the wave-function ip(r) satisfies 
the free-space Schrodinger equation (7.54). The most general solution which is 
consistent with no incoming spherical-waves is 

oo 

^ W = 7^372 Y i l (2 1 + 1 ) A 1 (t) PlIcos 0) , (7.81) 

(271)3/2 Z_ 
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where 

A t (r) =exp(i&0 [cos Sutler) - sin S^kr)] . (7.82) 

Note that Neumann functions are allowed to appear in the above expression, 
because its region of validity does not include the origin (where V ^ 0). The 
logarithmic derivative of the 1th radial wave-function Ai(r) just outside the range 
of the potential is given by 

cos6ij{(ka) — sin6 l ri l / (ka) 



(3 l+ = ka 



(7.83) 



cos6 l j l (ka) - sinSLn^ka) 
where j{(x) denotes dji(x)/dx, etc. The above equation can be inverted to give 

kaj{(ka) - (3 l+ j L (ka) 



tan &i = 



(7.84) 



kanaka) - (3 l+ rn(ka) ' 

Thus, the problem of determining the phase-shift 6 L is equivalent to that of ob- 
taining (3 l+ . 

The most general solution to Schrodinger's equation inside the range of the 
potential (r < a) which does not depend on the azimuthal angle (p is 



-i oo 

Mr) = Xi l (2l+1)Ri(r)Pi(cos9) ; 



where 



and 



Rl(T) = 



ui(r) 



d 2 u L 
dr 2 

The boundary condition 



+ 



k2 _2m v _l0 + 1) 
h 2 r 2 



MO) = o 



u L = 0. 



(7.85) 



(7.86) 



(7.87) 



(7.88) 



ensures that the radial wave-function is well-behaved at the origin. We can 
launch a well-behaved solution of the above equation from r = 0, integrate out 
to r = a, and form the logarithmic derivative 



Pi- = 



1 dim/r) 



(ui/r) dr 
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Since ip(r) and its first derivatives are necessarily continuous for physically ac- 
ceptible wave-functions, it follows that 

(3i+ = (3 L _. (7.90) 

The phase-shift b\ is obtainable from Eq. (7.84). 



7.7 Hard sphere scattering 

Let us test out this scheme using a particularly simple example. Consider scatter- 
ing by a hard sphere, for which the potential is infinite for r < a, and zero for 
r > a. It follows that ip(r) is zero in the region r < a, which implies that u\ = 
for all I. Thus, 

|3i = p l+ = oo, (7.91) 
for all I. It follows from Eq. (7.84) that 

tan5 l = — - — -. (7.92) 
mOca) 

Consider the 1 = partial wave, which is usually referred to as the s-wave. 
Equation (7.92) yields 

sin(ka)/ka 

tan 6 = n , = - tan k a, (7.93) 

— cos(ka)/ka 

where use has been made of Eqs. (7.58)-(7.59). It follows that 

6 = -ka. (7.94) 

The s-wave radial wave function is 

[coska sinkr — sink a coskr] 



Aq(t) = exp(— ika) 



kr 



sin[k(r-a)] 

= exp — ika) - . (7.95) 

kr 

The corresponding radial wave-function for the incident wave takes the form 

~ , . sinkr 

A t = • (7-96) 

kr 
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It is clear that the actual 1 = radial wave-function is similar to the incident 1 = 
wave-function, except that it is phase-shifted by k a. 

Let us consider the low and high energy asymptotic limits of tan b\. Low energy 
means ka < 1. In this regime, the spherical Bessel functions and Neumann 
functions reduce to: 

Wkr) . (7.97) 



(21-1)!! 
Tkr) 

where n!! = n (n — 2) (n — 4) • • • 1 . It follows that 



mflcT) ^ - n ,„ n+ V" , (7-98) 



— (k al 2l+1 

ta " Sl = (21+ 1) [(21- I)!!] 2 ' ° m 

It is clear that we can neglect 5i, with I > 0, with respect to 6q. In other words, 
at low energy only s-wave scattering (i.e., spherically symmetric scattering) is 
important. It follows from Eqs. (7.33), (7.75), and (7.94) that 

da sin 2 k a 
dO = k 2 

for k a <C 1 . Note that the total cross-section 

da 



a 2 (7.100) 



°"total 



da 



da = 47Ta z (7.101) 



is four times the geometric cross-section na 2 (i.e., the cross-section for classical 
particles bouncing off a hard sphere of radius a) . However, low energy scattering 
implies relatively long wave-lengths, so we do not expect to obtain the classical 
result in this limit. 

Consider the high energy limit ka > 1 . At high energies, all partial waves 
up to l max = k a contribute significantly to the scattering cross-section. It follows 
from Eq. (7.77) that 

Anr l max 

^1 = ^^(21+1) sin 2 6 L . (7.102) 

1=0 
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With so many I values contributing, it is legitimate to replace sin 5i by its average 
value 1 /2. Thus, 




ka 



o-totai =)" TT (2l + l)~27ra 2 . (7.103) 



This is twice the classical result, which is somewhat surprizing, since we might ex- 
pect to obtain the classical result in the short wave-length limit. For hard sphere 
scattering, incident waves with impact parameters less than a must be deflected. 
However, in order to produce a "shadow" behind the sphere, there must be scat- 
tering in the forward direction (recall the optical theorem) to produce destruc- 
tive interference with the incident plane-wave. In fact, the interference is not 
completely destructive, and the shadow has a bright spot in the forward direc- 
tion. The effective cross-section associated with this bright spot is 7ta 2 which, 
when combined with the cross-section for classical reflection, n a 2 , gives the ac- 
tual cross-section of 2n a 2 . 

7.8 Low energy scattering 

At low energies (i.e., when 1 /k is much larger than the range of the potential) 
partial waves with I > 0, in general, make a negligible contribution to the scatter- 
ing cross-section. It follows that, at these energies, with a finite range potential, 
only s-wave scattering is important. 

As a specific example, let us consider scattering by a finite potential well, char- 
acterized by V = Vo for r < a, and V = for r > a. Here, Vo is a constant. 
The potential is repulsive for Vo > 0, and attractive for Vo < 0. The outside 
wave-function is given by [see Eq. (7.82)] 



where use has been made of Eqs. (7.58)-(7.59). The inside wave-function follows 



Ao(r) 



exp( i 6 ) [jo Oct) cos 6 - Tlo(kr) sin 6 ] 
exp( i5o) sin(kr + So) 



(7.104) 



kr 



(7.105) 
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from Eq. (7.87). We obtain 



Ao(r) = B Sink,T , (7.106) 



where use has been made of the boundary condition (7.88). Here, B is a constant, 
and 

h 2 k /2 

E-V = - . (7.107) 

2m 

Note that Eq. (7.106) only applies when E > Vq. For E < Vo, we have 

Ao(r) = B (7.108) 
r 

where 

h 2 K 2 

V -E = — . (7.109) 
2m 

Matching Ao(r), and its radial derivative at r = a, yields 

k 

tan(ka + 5 ) = — tank' a (7.110) 
k' 

for E > Vo, and 

k 

tan(ka + 6 ) = - tanhKa (7.111) 

K 

for E < V . 

Consider an attractive potential, for which E > Vo. Suppose that \V \ > E 
(i.e., the depth of the potential well is much larger than the energy of the inci- 
dent particles), so that k' ;» k. It follows from Eq. (7.110) that, unless tank'a 
becomes extremely large, the right-hand side is much less that unity, so replacing 
the tangent of a small quantity with the quantity itself, we obtain 

k 

ka + 6 - r- tank'a. (7.112) 

k' 

This yields 

^kal^-n. (7 . 113 ) 
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According to Eq. (7.102), the scattering cross-section is given by 

2 



°"total 



4tt 



sin 6 = 4tt a 



2 / tan k'a 
\ k'a 



1 



Now 



k'a 



\ 



k 2 a 2 + 



2m|V |a 2 



(7.114) 



(7.115) 



so for sufficiently small values of k a, 



k'a 



\ 



2m|V |a 2 



(7.116) 



It follows that the total (s-wave) scattering cross-section is independent of the 
energy of the incident particles (provided that this energy is sufficiently small) . 

Note that there are values of k'a (e.g., k'a ~ 4.49) at which 6o — > n, an d the 
scattering cross-section (7.114) vanishes, despite the very strong attraction of the 
potential. In reality, the cross-section is not exactly zero, because of contributions 
from I > partial waves. But, at low incident energies, these contributions are 
small. It follows that there are certain values of Vo and k which give rise to almost 
perfect transmission of the incident wave. This is called the Ramsauer-Townsend 
effect, and has been observed experimentally. 



7.9 Resonances 



There is a significant exception to the independ ence of the cross-section on en- 
ergy. Suppose that the quantity ^2m|Vo| a 2 /h 2 is slightly less than n/2. As the 
incident energy increases, k'a, which is given by Eq. (7.115), can reach the value 
n/2. In this case, tan k'a becomes infinite, so we can no longer assume that the 
right-hand side of Eq. (7.110) is small. In fact, at the value of the incident energy 
when k'a = n/2, it follows from Eq. (7.110) that ka + So = tt/2, or §o — ti/2 
(since we are assuming that k a <C 1 ) . This implies that 

o-totai = ^sin 2 6 = 47ta 2 (^g) ■ (7.117) 
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Note that the cross-section now depends on the energy. Furthermore, the mag- 
nitude of the cross-section is much larger than that given in Eq. (7.114) for 
k'a^ n/2 (since k a <C 1 ) . 

The origin of this rather strange behaviour is quite simple. The condition 



2m|V |a 2 _7t 

M h 2 "I (7 - 118) 

is equivalent to the condition that a spherical well of depth Vo possesses a bound 
state at zero energy. Thus, for a potential well which satisfies the above equation, 
the energy of the scattering system is essentially the same as the energy of the 
bound state. In this situation, an incident particle would like to form a bound 
state in the potential well. However, the bound state is not stable, since the 
system has a small positive energy. Nevertheless, this sort of resonance scattering 
is best understood as the capture of an incident particle to form a metastable 
bound state, and the subsequent decay of the bound state and release of the 
particle. The cross-section for resonance scattering is generally far higher than 
that for non-resonance scattering. 

We have seen that there is a resonant effect when the phase-shift of the s-wave 
takes the value n/2. There is nothing special about the 1 = partial wave, so it 
is reasonable to assume that there is a similar resonance when the phase-shift of 
the 1th partial wave is n/2. Suppose that 6 L attains the value n/2 at the incident 
energy E , so that 

6i(E ) = ^. (7.119) 
Let us expand cot b\ in the vicinity of the resonant energy: 

cot5t(E) = cot6 l (E ) + f^^M (E-E ) + --- (7.120) 



Defining 




(7.121) 



(7.122) 



E=E 
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we obtain 

cotS^E) = ~(E-E ) + "-. (7.123) 

Recall, from Eq. (7.80), that the contribution of the 1th partial wave to the scat- 
tering cross-section is 

cn = % (2 1 + 1 ) sin^ 6l = % (1 1 + 1 ) TT ^. (7.124) 

4tt r 2 /4 
^F (21+1) ( e-e )W (7 - 125) 

This is the famous Breit-Wigner formula. The variation of the partial cross-section 
<Ji with the incident energy has the form of a classical resonance curve. The 
quantity V is the width of the resonance (in energy) . We can interpret the Breit- 
Wigner formula as describing the absorption of an incident particle to form a 
metastable state, of energy Eq, and lifetime t = h/V (see Sect. 6.17). 



Thus, 
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